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Abstract 

In this paper we introduce a synthetic notion of Riemannian Ricci bounds from below 
£^ , for metric measure spaces (X, d,m) which is stable under measured Gromov-Hausdorff 

^ ' convergence and rules out Finsler geometries. It can be given in terms of an enforcement 

, of the Lott, Sturm and Villani geodesic convexity condition for the entropy coupled with 

the linearity of the heat flow. Besides stability, it enjoys the same tensorization, global- 
£h to-local and local-to-global properties. In these spaces, that we call RCD(K, oo) spaces, 

we prove that the heat flow (which can be equivalently characterized either as the flow 
, associated to the Dirichlet form, or as the Wasserstein gradient flow of the entropy) 

' satisfies Wasserstein contraction estimates and several regularity properties, in particular 

Bakry-Emery estimates and the L°° — Lip Feller regularization. We also prove that the 
distance induced by the Dirichlet form coincides with d, that the local energy measure 
has density given by the square of Checger's relaxed slope and, as a consequence, that 
• ' the underlying Brownian motion has continuous paths. All these results are obtained 

independently of Poincare and doubling assumptions on the metric measure structure and 
I ■ therefore apply also to spaces which are not locally compact, as the infinite-dimensional 

7—1 \ ones. 
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1 Introduction 

The problem of finding synthetic notions of Ricci curvature bounds from below has been 
a central object of investigation in the last few years. What became clear over time (see 
in particular [15] and [9, Appendix 2]), is that the correct class of spaces where such a 
synthetic notion can be given, is that of metric measure spaces, i.e. metric spaces equipped 
with a reference measure which one might think of as volume measure. The goal is then to 
find a notion consistent with the smooth Riemannian case, which is sufficiently weak to be 
stable under measured Gromov-Hausdorff limits. The problem of having stability is of course 
in competition with the necessity to find a condition as restrictive as possible, to describe 
efficiently the closure of the class of Riemannian manifolds with Ricci curvature uniformly 
bounded from below. 

In their seminal papers Lott-Villani [27] and Sturm [39] independently attacked these ques- 
tions with tools based on the theory of optimal transportation, devising stable and consistent 
notions. In these papers, a metric measure space (X, d,m) is said to have Ricci curvature 
bounded from below by K G R (in short: it is a CD(K, oo) space) if the relative entropy 
functional 



is K geodesically convex on the Wasserstein space (&2(X), VF2). 

Also, in [27], [40] a synthetic notion CD(K, N) of having Ricci curvature bounded from 
below by K and dimension bounded above by N was given (in [27] only the case CD(0,N) 
was considered, for N < oo), and a number of geometric consequences of these notions, like 
Brunn-Minkowski and Bishop-Gromov inequalities, have been derived. In [26] it was also 
proved that, at least under the nonbranching assumption, the CD(K, N) condition implies 
also the Poincare inequaliy, see also [33] for some recent progress in this direction. 

An interesting fact, proved by Cordero-Erasquin, Sturm and Villani (see the conclusions 
of [43]), is that M. d equipped with any norm and with the Lebesgue measure, is a CD(0,N) 
space. More generally, Ohta showed in [29] that any smooth compact Finsler manifold is 
a CD(K, N) space for appropriate finite K,N. However, a consequence of the analysis of 
tangent spaces done in [9], is that a Finsler manifold arises as limit of Riemannian manifolds 




with /i = pm 
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with Ricci curvature uniformly bounded below and dimension uniformly bounded from above, 
if and only if it is Riemannian (the case of possibly unbounded dimension of the approximating 
sequence is covered by the stability of the heat flow proved in [18] in conjunction with the 
fact that the heat flow on a Finsler manifold is linear if and only if it is Riemannian [30]). 

Therefore it is natural to look for a synthetic and stable notion of Ricci curvature bound 
which rules Finsler spaces out. This is the scope of this paper. What we do, roughly said, 
is to add to the CD(K, oo) condition the linearity of the heat flow, see below for the precise 
definition. 

Before passing to the description of the results of this paper, we recall the main results of 
the "calculus" in the first paper of ours [2], needed for the development and the understanding 
of this one. The main goals of [2] have been the identification of two notions of gradient and 
two gradient flows. 

The first notion of gradient, that we call minimal relaxed gradient and denote by |V/|*, is 
inspired by Cheeger's work [8]: it is the local quantity that provides integral representation 
to the functional Ch(/) given by 

Ch(/) :=iinf jhminf J |VM 2 dm: f h £ Lip(X), jf \f h - f\ 2 dm -> fjj (1.1) 

(here | Vfh\ is the so-called local Lipschitz constant of fh), so that Ch(/) = \ f W f\1 dm. The 
second notion of gradient, that we call minimal weak upper gradient and denote by [V/l^, 
is, instead, inspired by Shanmugalingam's work [36] and based on the validity of the upper 
gradient property 

|/(7i)-/(7o)|< / |V/U 

on "almost all" curves 7. We proved that the minimal weak upper gradient and the minimal 
relaxed gradient coincide. In addition, although our notions of null set of curves differs 
from [36] and the definition of Ch differs from [8], we prove, a posteriori, that the gradients 
coincide with those in [8], [36]. Since an approximation by Lipschitz functions is implicit in 
the formulation (1.1), this provides a density result of Lipschitz function in the weak Sobolev 
topology without any doubling and Poincare assumption on (X, d,m). In the context of 
the present paper, where Ch will be a quadratic form even when extended to m-measurable 
functions using weak upper gradients, this approximation result yields the density of Lipschitz 
functions in the strong Sobolev topology. 

The concept of minimal relaxed gradient can be used in connection with "vertical" variations 
of the form e 1— > f + eg, which occur in the study of the L 2 (X, m)-gradient flow of Ch, whose 
semigroup we shall denote by Hj. On the other hand, the concept of minimal weak upper 
gradient is relevant in connection with "horizontal" variations of the form t *— > /("ft), which 
play an important role when study the derivative of Ent m along geodesies. For this reason 
their identification is crucial, as we will see in Section 4. Given this identification for granted, 
in the present paper most results will be presented and used at the level of minimal weak 
upper gradients, in order to unify the exposition. 

Finally, in CD(K, 00) spaces we identified the L 2 -gradient flow Ht of Ch (in the sense of 
the Hilbertian theory [6]) with the Vi^-gradient flow of Ent m in the Wasserstein space of 
probability measures &2{X) (in the sense of De Giorgi's metric theory, see [3] and, at this 
level of generality, [18]), which we shall denote by Mi- A byproduct of this identification is an 
equivalent description of the entropy dissipation rate along the flow, equal to 4 J | V-y/Hi/'l^ dm 
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and to the square of the metric derivative of fit = J%t(fm) w.r.t. W2. 

All these results have been obtained in [2] under very mild assumptions on m, which include 
all measures such that e _cd ( x > x °)x(i is finite for some c > and xq S X. In this paper, in order 
to minimize the technicalities, we assume that m is a probability measure with finite second 
moment. On the other hand, no local compactness assumption on (X, d) will be needed, so 
that infinite-dimensional spaces fit well into this theory. 

Coming back to this paper, we say that a metric measure space (X, d, m) has Riemannian 
Ricci curvature bounded from below by K G R, and write that (X, d,m) is RCD(K, 00), if 
one of the following equivalent conditions hold: 

(i) (X, d,tti) is a strong CD(K, 00) space and the W/Vgradient flow of Ent m is additive. 

(ii) (X, d,m) is a strong CD(K, 00) space and Ch is a quadratic form in L 2 (X, m), so that 
the L 2 -heat flow of Ch is linear. 

(iii) The gradient flow of Ent m exists for all initial data fi with supp fi C suppm and satisfies 
the EVIk condition. 

The equivalence of these conditions is not at all obvious, and its proof is actually one of the 
main results of this paper. 

Observe that in (i) and (ii) the CD(K, 00) is enforced on the one hand considering a 
stronger convexity condition (we describe this condition in the end of the introduction, being 
this aspect less relevant), on the other hand adding linearity of the heat flow. A remarkable 
fact is that this combination of properties can be encoded in a single one, namely the EVIx 
property. This latter property can be expressed by saying that for all v £ ^2 (A) with finite 
entropy the gradient flow M[(fi) starting from \x satisfies 

d ^m.i/) + K w ^ M ^ v) +Entin( ^ (/U)) < Entm(l/) for ae t G (0;Oo) _ (L2) 

It is immediate to see that the RCD(K, 00) notion is consistent with the Riemannian case: 
indeed, uniqueness of geodesies in (£p2(M), W2) between absolutely continuous measures and 
the consistency of the CD(K, 00) notion, going back to [11, 42], yield that manifolds are strong 
CD(K, 00) spaces (see below for the definition), and the fact that Ch is quadratic is directly 
encoded in the Riemannian metric tensor, yielding the linearity of the heat flow. On the other 
hand, the stability of RCD(K, 00) bounds with respect to the measured Gromov-Hausdorff 
convergence introduced by Sturm [39] is a consequence, not too difficult, of condition (iii) 
and the general stability properties of EVI/^ flows (see also [34, 35] for a similar statement). 
We remark also that thanks to the results in [20, 29, 32, 44], compact and finite-dimensional 
spaces with Alexandrov curvature bounded from below are RCD(K, 00) spaces. 

Besides this, we prove many additional properties of RCD(K, 00) spaces. Having (1.2) 
at our disposal at the level of measures, it is easy to obtain fundamental solutions, in- 
tegral representation formulas, regularizing and contractivity properties of the heat flow, 
which exhibits a strong Feller regularization from L°°(X, m) to Lipschitz. Denoting by 
W l > 2 (X, d, m) C L 2 (X, m) the finiteness domain of Ch, the identification of the L 2 -gradient 
flow of Ch and of the W2-g r adient flow of Ent m in conjunction with the if-contractivity of 
W2 along the heat flow, yields, as in [19], the Bakry-Emery estimate 

\V(H t f)\l<e- 2Kt H t (\Vf\ 2 w ) m-a.e. in X 
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for all / S W 1,2 (X, d, m). As a consequence of this, we prove that functions / whose minimal 
weak upper gradient (V/l^, belongs to L°°(X, m) have a Lipschitz version /, with Lip(/) < 

HIV/ML- 

In connection with the tensorization property, namely the stability of RCD(K, oo) metric 
measure spaces with respect to product (with squared product distance given by the sum of the 
squares of the distances in the base spaces) , we are able to achieve it assuming that the base 
spaces are nonbranching. This limitation is due to the fact that also for the tensorization 
of CD(K, oo) spaces the nonbranching assumption has not been ruled out so far (see [39, 
Theorem 4.17]). On the other hand, we are able to show that the linearity of the heat 
flow tensorizes, when coupled just with the strong CD(K, oo) condition. The nonbranching 
assumption on the base spaces could be avoided with a proof of the tensorization property 
directly at the level of EVI^-, but we did not succeed so far in tensorizing the EVI# condition. 

Since Ch is a quadratic form for RCD(K, oo) spaces, it is tempting to take the point of 
view of Dirichlet forms and to describe the objects appearing in Fukushima's theory [16] of 
Dirichlet forms. In this direction, see also the recent work [22] and Remark 6.6. Independently 
of any curvature bound we show that, whenever Ch is quadratic, a Leibnitz formula holds 
and there exists a "local" bilinear map (/,<?) V/ ■ Vg from [VF 1,2 (X, d, m)] 2 to L X (X, m), 
that provides an integral representation to the Dirichlet form E(u,v) associated to Ch. This 
allows us to show that the local energy measure [u] of Fukushima's theory coincides precisely 
with [Vu^m. If the space is RCD(K, oo) then the intrinsic distance dg, associated to the 
Dirichlet form by duality with functions u satisfying [u] < m is precisely d. The theory of 
Dirichlet forms can also be applied to obtain the existence of a continuous Brownian motion 
in RCD(K, oo) spaces, i.e. a Markov process with continuous sample paths and transition 
probabilities given by J%t(5 x ). 

Besides the extension to more general classes of reference measures m, we believe that this 
paper opens the door to many potential developments: among them we would like to mention 
the dimensional theory, namely finding appropriate "Riemannian" versions of the CD(K, N) 
condition, and the study of the tangent space. In connection with the former question, since 
CD(K, N) spaces are CD(K, oo), a first step could be analyzing them with the calculus 
tools we developed and to see the impact of the linearity of the heat flow and of the EVI^ 
condition stated at the level of Ent m . Concerning the latter question, it is pretty natural to 
expect RCD(K, oo) spaces to have Hilbertian tangent space for m-a.e. point. While the proof 
of this result in the genuine infinite dimensional case seems quite hard to get, if the space is 
doubling and supports a local Poincare inequality, one can hope to refine Cheeger's analysis 
([8, Section 11]) in order to achieve it. 

The paper is organized as follows. In Section 2 we introduce our main notation and 
the preliminary results needed for the development of the paper. With the exception of 
Section 2.4, where we quote from [2] the basic results we already alluded to, namely the 
identification of weak gradients and relaxed gradients and the identification of L 2 -gradient 
flow of Ch and WVgradient flow of Ent m , the material is basically known. Particularly relevant 
for us will be the EVI^ formulation of gradient flows, discussed in Section 2.5. 

In Section 3 we introduce a convexity condition, that we call strong CD(K, oo), interme- 
diate between the CD(K, oo) condition, where convexity is required along some geodesic, and 
convexity along all geodesies. It can be stated by saying that, given any two measures, there 
is always an optimal geodesic plan 7r joining them such that the .ff-convexity holds along all 
the geodesies induced by weighted plans of the form Ftv, where F is a bounded, non negative 
function with J F d7v = 1. We also know from [13] that the EVI^ condition implies convexity 
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along all geodesies supported in suppm, and therefore the strong CD(K, oo) property. 
This enforcement of the CD(K, oo) condition is needed to derive strong L°° bounds on the 
interpolating measures induced by the "good" geodesic plan. These "good" interpolating 
measures provide large class of test plans and are used to show, in this framework, that 
the "metric Brenier" theorem [2, Theorem 10.3] holds. Roughly speaking, this theorem 
states that, when one transports in an optimal way fi to v , the transportation distance 
d (70,71) depends 7r-almost surely only on the initial point 70 (and in particular it is inde- 
pendent on the final point 71). Furthermore, the proof of this result provides the equality 
d(7o,7i) = |V + 9j|(7o) = |V<^| w (7i) for 7r-a.e. 7, where ip is any Kantorovich potential relative 
to (p, v). This equality will be crucial for us when proving optimal bounds for the derivative 
of Ent m along geodesies. 

In Section 4 we enter the core of the paper with two basic formulas, one for the derivative 
of the Wasserstein distance along the heat flow (ptvn) (Theorem 4.1), obviously important for a 
deeper understanding of (1.2), the other one for the derivative of the entropy (Theorem 4.8) 
along a geodesic p s = p s m. The proof of the first one uses the classical duality method 
and relates the derivative of W^iptm, v) to the "vertical" derivative of the density pt in the 
direction given by Kantorovich potential from ptm to v. The second one involves much 
more the calculus tools we developed. The key idea is to start from the (classical) convexity 
inequality for the entropy, written in terms of the optimal geodesic plan tv from p^m. to pitn 

Ent m (p s m) - Ent m (p tn) > J log p (p s - p ) dm = J (log p (j s ) - log p (7o)) dvr(7) (1.3) 

and then use the crucial Lemma 4.5, relating the "horizontal" derivatives appearing in (1.3) 
to the "vertical" ones. In section 4.3 the same lemma, applied to suitable plans generated 
by the heat flow, is the key to deduce a local quadratic structure from a globally quadratic 
Cheeger energy and to develop useful calculus tools, leading in particular to the identification 
of |Vu|^m with the energy measure [u] provided by the general theory of Dirichlet forms. 

Section 5 is devoted to the proof of the equivalence of the three conditions defining 
RCD(K, 00) spaces, while the final Section 6 treats all properties of RCD(K, 00) spaces 
we already discussed: representation, contraction, and regularizing properties of the heat 
flow, relations with the theory of Dirichlet forms and existence of the Brownian motion, sta- 
bility, tensorization. We also discuss, in the last section, the so-called global-to-local and 
local-to-global implications. We prove that the first one always holds if the subset under 
consideration is convex, with positive m-measure and m-negligible boundary. We also prove a 
partial result in the other direction, from local to global, comparable to those available within 
the CD(K, 00) theory. 

Acknowledgement. The authors acknowledge the support of the ERC ADG GeMeThNES. 

2 Preliminaries 

2.1 Basic notation, metric and measure theoretic concepts 

Unless otherwise stated, all metric spaces (Y, dy) we will be dealing with are complete and 
separable. Given a function E : Y — > R U {±00}, we shall denote its domain {y : E{y) € R} 
by D(E). The slope (also called local Lipschitz constant) |V.E|(y) of E at y £ D{E) is defined 
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\VE\(y) : = limsup ^M^M . (2.1) 

By convention we put \VE\(y) = +00 if y ^ D(E) and |Vi?|(y) = if y G D(E) is isolated. 

We shall also need the one-sided counterparts of this concept, namely the descending 
slope (in the theory of gradient flows) and the ascending slope (in the theory of Kantorovich 
potentials). Their are defined at y G D(E) by 

\V-E\(y) := Ito^fiM-EM)-, |V*JS|<») := limsup " E <" ))+ 



dy(y,^) z-^ dy(z,y) 



with the usual conventions if either y is isolated or it does not belong to D{E). 

We will denote by C([0, 1];Y) the space of continuous curves on (Y, dy); it is a complete 
and separable metric space when endowed with the sup norm. We also denote with ej : 
C([0, 1]; Y) ->• Y, t G [0, 1], the evaluation maps 

et(7):=7t V 7 G C([0, 1]; Y). 

A curve 7 : [0, 1] — > Y is said to be absolutely continuous if 

dy( 7 t,7s)< f 9(r)dr Vs, t G [0, 1], s < t, (2.2) 

J s 

for some g G L 1 (0, 1). If 7 is absolutely continuous, the metric speed | 7 | : [0,1] — > [0, 00] is 
defined by 

171 • h™ \h\ 

and it is possible to prove that the limit exists for a.e. t, that | 7 | G L 1 (0, 1), and that it is the 
minimal L 1 function (up to Lebesgue negligible sets) for which the bound (2.2) holds (see [3, 
Theorem 1.1.2] for the simple proof). 

We shall denote by AC 2 ([0, 1];Y) the class of absolutely continuous curves with metric 
derivative in L 2 (0, 1); it is easily seen to be a countable union of closed sets in C([0, 1];Y) 
and in particular a Borel subset. 

A curve 7 G C([0, 1]; Y) is called constant speed geodesic if dy (7t,7s) = \t — s|dy (70,71) 
for all s, t G [0, 1]. We shall denote by Geo(Y) the space of constant speed geodesies, which 
is a closed (thus complete and separable) subset of C([0, 1]; Y). 

(Y, dy) is called a length space if for any yo, yi G Y and e > there exists 7 G AC([0, 1]; Y) 
such that 



7o = yo, 7i =2/1 and Length^) := / \j t \ dt < dy (y , yt) + e. (2.3) 

J 

If for any yo, y\ G Y one can find 7 satisfying (2.3) with e = (and thus, up to a reparam- 
eterization, 7 G Geo(Y)), we say that (Y, dy) is a geodesic space. We also apply the above 
definitions to (even non closed) subsets Z C Y, always endowed with the distance dy induced 
by Y. It is worth noticing that if Z is a length space in Y, then Z is a length space in Y as 
well [7, Ex. 2.4.18]. 

We use standard measure theoretic notation, as Cb(X) for bounded continuous maps, /j 
for the push forward operator induced by a Borel map /, namely f$n(A) := /x(/~ 1 (^4)), /j,\_A 
for the restriction operator, namely fi\_A(B) = fj,(A n .B). 
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2.2 Reminders on optimal transport 

We assume that the reader is familiar with optimal transport, here we just recall the notation 
we are going to use in this paper and some potentially less known constructions. Standard 
references are [1, 3, 43] and occasionally we give precise references for the facts stated here. 

Given a complete and separable space (X, d), 3 > 2(X) is the set of Borel probability mea- 
sures with finite second moment, which we endow with the Wasserstein distance W2 defined 
by 

Wf 0, v) := min J d 2 (x, y) d 7 (x, y), (2.4) 

the minimum being taken among the collection Adm(/x, v) of all admissible plans (also called 
couplings) 7 from \i to v, i.e. all measures 7 E 3 s {X. x X) such that irl^f = /i, 7r 2 7 



v . 



All the minimizers of (2.4) are called optimal plans and their collection (always non empty, 
since (i, v & 3 > 2(X)) is denoted by Opt(//, v). The metric space (3 } 2(X), W2) is complete 
and separable; it is also a length or a geodesic space if and only if X is, see for instance [1, 
Theorem 2.10, Remark 2.14]. 

Given a reference measure m, we shall also use the notation 

^P^m) := {n E 3> 2 (X) : supp/U C suppm}. 

The c-transform of a function ip : X — > R U {—00}, relative to the cost c = ^d 2 , is defined by 

Notice that still ip c takes its values in R U {—00}, unless ip = —00. A function ip : X — > 
R U {—00} is said to be c-concave if <p = ip c for some i/):l-flU {—00}. A set rdxX 
is c-cyclically monotone if 

n n 

}^ c(xj, yj) < yjc(xj,y CTffl ) V?i > 1, (xi,yi) E T, a permutation. 

i=l i=l 

Given /i, v E &2 (X) there exists a c-cyclically monotone closed set T containing the support 
of all optimal plans 7. In addition, there exists a (possibly non unique) c-concave function 
ip E L 1 {X,ij) such that ip c E L l {X,v) and f{x) + <p c (y) = c(x,y) on F. Such functions are 
called Kantorovich potentials. We remark that the typical construction of cp (see for instance 
[3, Theorem 6.1.4]) gives that <p is locally Lipschitz in X if the target measure v has bounded 
support. Conversely, it can be proved that 7 E Adm(yU, u) and supp7 c-cyclically monotone 
imply that 7 is an optimal plan. 

It is not hard to check that (see for instance [2, Proposition 3.9]) 

\V + ip\(x) < d(x,y) for 7-a.e. (x,y), (2.5) 

for any optimal plan 7 and Kantorovich potential (p from fi to v. 

If fi and v are joined by a geodesic in {^2 (X), W2), the distance W2 can be equivalently 
characterized by 

W$(ji,v) = mmj J | 7t | 2 dtd7 r ( 7 ), (2.6) 
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among all measures 7r G ^ 2> (C([0, 1]; X)) such that (eo)^7r = fx, (ei)j7r = u, where the 2- 
action f |7t| 2 dt is taken by definition +00 is 7 is not absolutely continuous. The set of 
minimizing plans 7r in (2.6) will be denoted by GeoOpt(/x, v). It is not difficult to see that 
7r G GeoOpt(yU, u) if and only if 7 := (eo, ei)jj7r G &{X x X) is a minimizer in (2.4) and tv is 
concentrated on Geo(X). Furthermore, a curve (/it) is a constant speed geodesic from /to to 
,ui if and only if there exists n G GeoOpt(/zo, Pi) such that 

/i t = (et)jj7r Vi G [0, 1], 

see for instance [1, Theorem 2.10] and notice that the assumption that (X, d) is geodesic is 
never used in the proof of (i) 44> (ii). 

The linearity of the transport problem immediately yields that the squared Wasserstein 
distance VF 2 2 (-,-) is jointly convex. This fact easily implies that if (pi), (p 2 ) C &<i{X) are 
two absolutely continuous curves, so is t 1— > pt '■= (1 — typt + ^Mt f° r an y ^ £ [0? 1]> with an 
explicit bound on its metric speed: 

lAtl 2 < (1 - A)|/it| 2 + X\p 2 \ 2 for a.e. t G [0, 1]. (2.7) 

Finally, we recall the definition of push forward via a plan, introduced in [39] (with a 
different notation) and further studied in [18], [2]. 

Definition 2.1 (Push forward via a plan) Let 7 G &*(X x Y). For p G 3^{X) such that 
p = p (tt? 7) <C ^flj the push forward -y^p G ^(K) 0/ /i wa 7 is defined by 

TjfM : = 7r J / '((P 07rX )7)- 
An equivalent representation of 7jj/x is 

7^ = ^7Tj y 7 where r](y) := p(x)drf y (x) (2.8) 

and {7 y }yey C ^*(X) is the disintegration of 7 w.r.t. the projection on Y. 

Defining 7" 1 := (7r Y ,7r x )^ G &*(Y x X), we can define in a symmetric way the map 
v 1 — y lj lv £ &(X) for any 1/ <C ttJj^ 1 = 71T7. 

Notice that if 7 is concentrated on the graph of a map T : X — > Y, it holds 7^ = Tjj/t for 
any /i <C ^7, and that typically 7^ 1 (7jj/i) 7^ /t- We collect in the following proposition the 
basic properties of 7^ in connection with the Wasserstein distance. 

Proposition 2.2 The following properties hold: 

(i) p < C K^~f for some C > implies *y^p < C 7^7. 

(lij Let p, v & and 7 G Opt(/i, i/). T/ien /or every /t G suc/i i/iat p <^ p it 

holds 

W%{p, 1 fp) = Jd 2 (x,y)^(x)d 1 (x,y) (2.9) 

and, in particular, 7^ G ^(^O and ^ o 7r 1 7 G Opt(/2, 7jj/i) i/ aay 0/ i/ie too terms is 
finite. 
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(Hi) Let 7 G ^ 2 {X x Y), C> and A c := {/j G ^> 2 P0 : H < CttJt}- ^/jen 

/i I—)- 7|/x is uniformly continuous in Ac w.r.t. the W2 distances. (2-10) 

(iv) Let 7 G &2(X x X) and /i < C 7^7 /or some constant C . Then 

W%(p, 7^) <C J d 2 (x, y) d 7 (x, y). (2.11) 

Proof, (i) is obvious. 

(ii) Since 7 is optimal, supp o 7r 1 7) C supp7 is c-cyclically monotone. Moreover ^ o 7r 1 7 
is an admissible plan from p to 7fl/i, with cost equal to the right hand side of (2.9). Hence, 
if the cost is finite, from the finiteness of W2{p,~i$p) we infer that *y^p G & 2 {X), hence 
c-cyclical monotonicity implies optimality and equality in (2.9). The same argument works if 
we assume that W2{p,~{$p) is finite. 

(iii) Since the singleton {71^7} is both tight and 2-uniformly integrable, the same is true for 
the set Ac, which, being WVclosed, is compact (see [3, Section 5.1] for the relevant definitions 
and simple proofs). Hence it is sufficient to prove the continuity of the map. Let (p n ) C Ac 
be W2-converging to p G Ac and let p n , p be the respective densities w.r.t. ^7. Since (p n ) 
converges to p in duality with Cb(X) and since the densities are equibounded, we get that p n 
converge to p weakly* in L°°(X, irj 7). By (i) and the same argument just used we know that 
(7(|Mn) C &2(Y) is relatively compact w.r.t. the Wasserstein topology, hence to conclude it 
is sufficient to show that (y^p n ) converges to -y^p in duality with Cb(Y). To this aim, fix 
(f G Cf,(y) and notice that it holds 

<p(y) d7|jMn(y) = f x Y d 7(^ y) = J x viv) ^ixiyn Pn(x) d^jix), 

where {~y x } is the disintegration of 7 w.r.t. the projection on the first component. Since (p is 
bounded, so is the map x i->- J ^pdj x , and the claim follows. 

(iv) Just notice that o 7r 1 7 G Adm(p, 1^)- D 

The operation of push forward via a plan has also interesting properties in connection 
with the relative entropy functional Ent m . We recall that, given m G &(X), the functional 
Ent m : @>(X) -> [0, 00] is defined by 



Ent m (/i) := < 



^log ) 



+00 otherwise. 
Proposition 2.3 For all 7 G 3?{X x Y) the following properties hold: 

t-y,r 



(i) For any m, p <C ^fl it holds Ent 7(tm (7jj/i) < Ent m (//). 



is convex in 



(ii) For any m <C ^7, C > 0, the map p 1— > Ent m (/i) — Ent 7t)m (7jj//) 
{p G &>(X) : p < Cm}. 

Proof, (i) We follow [1, Lemma 7.4] and [39, Lemma 4.19]. We can assume p <C m, otherwise 
there is nothing to prove. Then it is immediate to check from the definition that 7^ <C 7jjm. 
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Let p = pm, m = 6n^y, y^/j, = rjj^m, and e(z) := zlogz. By disintegrating 7 as in (2.8), we 
have that 

V(y) = J p(x)dj y (x), j y = (^j 9{x)dj y (x)^j 6 7 y . 
Using the convexity of e and Jensen's inequality with the probability measures jy we get 

e(v(y)) < J e(p(x))d-y y (x). 

Now, since 7 y are the conditional probability measures of 7 := [1*1/71-^7] o ir x ~f = 60 Tr Xi y, 
whose first marginal is m, by integration of both sides with respect to the second marginal of 
7, namely 7jjTn, we get 



Ent Tttro (7(j/i) = / e(r ? (y))d7jm(y)< J J e(p(x))dj y (x)dj t m(y) 

e(p(x)) dj(x, y) = / e(p)dm. 



(ii) This is proved in [2, Lemma 7.7] (see also [18, Proposition 11]). Notice that in [2] we 
worked under the assumption X = Y . but this makes no difference, since the as one can work 
on the disjoint union X UY endowed with a distance which extends those of X, Y. □ 

Remark 2.4 We remark that the property (i) above is true for any internal energy kind 
functional: as the proof shows, under the same assumptions on m, p,, 7 it holds 

U Jim (l$p) < U m (p), 

where U m (p) is given by j «(p) dm + u'(oo)p s (X) for some convex continuous function U : 
[0, 00) -^KU {+00} and p = pm + p s , with p s _L m. 

On the other hand, part (ii) does not always hold for these functionals: in [18] it has been 
shown that for U(z) := ^— j- one has that 

p ^ U m (p) - C/ 7t)m (7 t) /i) 

is convex on {p S £P(X) : p < Cm} for any C > if and only if 1 < a < 2. In particular, 
convexity does not hold for the functionals appearing in the definition of CD(K, N) bounds. 



2.3 Metric measure spaces and Sturm's distance © 

Throughout this paper we will always consider normalized metric measure spaces with finite 
variance, according to [39, §3.1]: in short, we will denote by X the set of (isomorphism classes 
of) metric measure spaces that we will consider, namely 

X := |(X,d,m) : (X,d) is complete, separable, and m £ ^PO}- ( 2 - 12 ) 

We say that the two metric measure spaces (X, dx,nix) and (Y, dy,my) are isomorphic if 
there exists a 

bijective isometry / : supp — > supp my such that f^mx = vn.y (2-13) 
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We say that (X, d,m) is length or geodesic if (suppm, d) is so, and these notions are invariant 
in the isomorphism class. 

Notice that (X, d, m) is always isomorphic to (suppm, d,m), so that it will often be not 
restrictive to assume the non-degeneracy condition suppm = X. 

In this section we recall the definition of the distance B between metric measure spaces, 
introduced by Sturm in [39], and its basic properties. 

Definition 2.5 (Coupling between metric measure spaces) Given two metric measure 
spaces (X,dj,mx), (Y, dy,trty), we consider the product space (X x Y, dxy), where dxy is 
the distance defined by 

dxr((xi,yi), (x 2 ,y 2 )) := \J d x (x 1 ,x 2 ) + d Y {yi,y 2 ), 

We say that a pair (d.,7) is an admissible coupling between (X, dx,TtXx), (Y,dy,my), and we 
write (d,7) G Adm((dx, m x ), (dy , my)), if: 

(a) d is a pseudo distance on X U Y (i.e. points at d-distance are not necessarily 
equal) which coincides with dx (resp. dy) when restricted to suppmx x suppmx (resp. 
suppmy x suppmyj. 

(b) 7 is a Borel measure on X xY such that tt^T = mx and nY~y = my. 

It is not hard to see that the set of admissible couplings is always non empty. Notice that 
the restriction of d to X x Y is Lipschitz continuous and therefore Borel (with respect to the 
product topology), as a simple application of the triangle inequality. 
The cost C(d,7) of a coupling is given by 

C(d, 7 ) := / d 2 (x,y)d~f(x,y). 

JXxY 

In analogy with the definition of W 2 , the distance B((X, dx, mx), {Y, dy , my)) is then defined 

as 

B 2 ((X,dx,mx),OMy,my)) :=infC(d, 7 ), (2.14) 

the infimum being taken among all couplings (d, 7 ) of (X, dx,rnx) an d (Y, dy,my). Since 
one can use the isometries in (2.13) to transfer couplings between two spaces to couplings 
between isomorphic spaces, a trivial consequence of the definition is that B actually depends 
only on the isomorphism class. In the next proposition we collect the main properties of B, 
see [39, Section 3.1]. 

Proposition 2.6 (Properties of B) The infimum in (2.14) is attained and a minimizing 
coupling will be called optimal. Also, B is a distance on X, and in particular B vanishes only 
on pairs of isomorphic metric measure spaces. 
Finally, (X, B) is a complete, separable and length metric space. 

Also, it can be shown [39, Lemma 3.7] that B metrizes the measured Gromov-Hausdorff 
convergence, when restricted to compact metric spaces with controlled diameter. We also 
remark that, in line with what happens with the Gromov-Hausdorff distance, a B-convergent 
sequence of metric measure spaces can be embedded into a common metric space: in this case 
the possibility to work in spaces where suppm is not equal to the whole space X turns out 
to be useful. 
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Proposition 2.7 Let (X n , d n , m n ) G X, n G N, and (X, d,m) £ X. T/ien i/ie following two 
properties are equivalent. 

(i) (X n ,d n ,m n ) A (X,d,m) as n -> oo. 

fn) There exist a complete and separable metric space (Y, dy) and isometrics f n : suppm n — > 
Y, n G N, / : suppm — > Y, such that W2({f n )^n, itJ m ) — ^ as n — > oo. 
Proof. (i) (if). Let (d n ,7 n ) be optimal couplings for (X, d,m), (X n ,d n ,m n ), n G N. 
Define Y := ^ (J^ suppm^ U suppm and the pseudo distance dy on Y by 

if x, x' G suppm U suppm n , 
if x G suppm n , x' G suppm m . 



dy(x,x') := 



d n (x, x ) , 
inf d n (x, x") + d m (x", x') 



By construction the quotient metric space (Y, dy) induced by the equivalence relation x ~ 
y <^ dy(x,y) = is separable. Possibly replacing it by its abstract completion we can also 
assume that it is complete. Denoting by f n , f the isometric embeddings of X n , X into Y, 



B((X,d,m),(X n ,d n ,m„)) 

so the conclusion follows. 
(ii) =^ (i). Straightforward. 



xux„ 



& 2 n {x,y)di n {x,y) > W 2 ((f n )tm n , f t m), 



□ 



2.4 Calculus and heat flow in metric measure spaces 
2.4.1 Upper gradients 

Recall that the slope of a Lipschitz function ip is an upper gradient, namely |"0(Ti) — VKto)! 
can be bounded from above by J | V?/>| for any absolutely continuous curve 7 : [0, 1] — > X. 

Lemma 2.8 Let (X,d,m) G X and tp : X -)• R Lipschitz. For all fi t G AC 2 ([0, 1]; &i{X)) it 
holds ^ 

J x ^^l~ _| V»d/io < J [J^\V^\ 2 dfi t y /2 \fi t \dt. (2.15) 

Proof. Applying [25], we can find a probability measure 7r in C([0, 1];-X") concentrated on 
AC 2 ([0, 1]; X) = and satisfying 

Mt = (e t ) B 7r for aUte [0,1], |At| 2 = ^ It*| 2 dvr( T ) for a.e. t G (0, 1). (2.16) 
By the upper gradient property of |V^| we get 

J tpdfjti- J ipdfi = J(tpoei - ipoe ))d7v <J(J | V-0 1 (Tt ) I Tt I d*) d 







|V^|(7t)l7t|d7r(7))dt 



I • |V^|%t)d7r( 7 ) 



1/2 



1/2 

|7t| 2 d7r( 7 )) dt 



jf ( J |V^| 2 d/xt) 1/2 |/i*|dt. 



□ 
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2.4.2 Weak upper gradients and gradient flow of Cheeger's energy 

Here we recall the definition and basic properties of weak upper gradients of real functions in 
the metric measure space (X, d , m) . All the concepts and statements that we consider here 
have been introduced and proven in [2], see § 5. In particular, here we shall consider measures 
concentrated in AC 2 ([0, l\; X) (see § 2.1). 

Definition 2.9 (Test plans and negligible collection of curves) We say that 

7r G <^(AC 2 ([0, 1];X)) is a test plan with bounded compression if there exists C = C(tt) > 

such that 

(et)jj7r < Cm for every t G [0, 1]. 

We will denote by T the collection of all the test plans with bounded compression. We say 
that a Borel set A C AC 2 ([0, 1];-X") is T-negligible if it {A) = for any test plan iv G T. 

Since we will always deal with test plans in T, we will often omit to mention exsplicitly T and 
the words "bounded compression" , and we will refer to them simply as test plans. 

A property which holds for every curve of AC 2 ([0, 1];X), except possibly for a subset of 
a negligible set, is said to hold for almost every curve. 

Definition 2.10 (Functions which are Sobolev along almost all curves) We say that 
f : X — > R is Sobolev along almost all curves if, for a.e. curve 7, / o 7 coincides a.e. in [0, 1] 
and in {0, 1} with an absolutely continuous map / 7 : [0, 1] — > R. 

Notice that the choice of the trivial test plan 7r := ijjtn, where i : X — > AC 2 ([0, 1];X) 
maps any point x G X to the constant curve 7 = x, yields that any Sobolev function along 
almost all curves is finite m-a.e. in X. In this class of functions we can define the notion of 
weak upper gradient and of minimal weak upper gradient. 

Definition 2.11 (Weak upper gradients) Given f : X — > R Sobolev along a.e. curve, a 
m-measurable function G : X — > [0, 00] is a weak upper gradient of f if 







for a.e. curve 7. 


(2.17) 




Jr 1 







Here and in the following, we write J Qry f for /(71) - /(70) and J^G for J Q G(7 t )|7 t |df. 

It turns out (see [2, Proposition 5.7, Definition 5.9]) that if and G±, Gi are weak upper 
gradients of /, then so is minjGijG^}. It follows that there exists a m-measurable function 
jV/jt,, : X [0,oo] weak upper gradient having the property that 

\Vf\ w < G m-a.e. in X 

for any other weak upper gradient G. Because of this m-a.e. minimality property, the function 
|V/|u, will be called the minimal weak upper gradient of /. Also, the property of being Sobolev 
along a.e. curve and the minimal weak upper gradient are invariant under modifications of / 
in m-negligible sets ([2, Proposition 5.8]). In addition, the minimal weak gradient is local in 
the following sense: if both /, g are Sobolev along a.e. curve then it holds 

|V/|™ = \Vg\ w m-a.e. on the set {/ = g}. (2.18) 
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Other useful and natural properties are: the restriction inequality [2, Remark 5.6] 

1/(7.) - /(7-)l < J" |V/U(7r)|7r| dr for a.e. 7 , for all [s,t] C [0, 1] (2.19) 

the chain rule [2, Proposition 5.14(b)]) 

|V(0o f)\ w = (p' o f\Vf\ w m-a.e. in X, if 4> is Lipschitz and nondecreasing, ^ 
I V(0 o < |0' o /| | V/|u, m-a.e. in X, if <f> is Lipschitz, 

and the weak Leibnitz rule 

\V(fg)\ w <\f\\Vg\ w + \g\\Vf\ w m-a.e. in X. (2.21) 

The Cheeger energy is the functional defined in the class of Borel functions / : X — > M by 

' 1 f 

- / |V/| 2 dm if / is Sobolev along 

+oo otherwise. 



Ch(/) := 



a.e. curve, 



Using the stability properties of weak upper gradients under weak convergence ([2, Theo- 
rem 5.12]) it can be proved that Ch is convex and lower semicontinuous w.r.t. convergence 
in m-measure (in particular w.r.t. m-a.e. convergence). For the domain of Ch in L 2 (X, m) 
we shall also use the traditional notation W 1 ' 2 (X,d,m), see [2, Remark 4.7]: it is a Banach 
space when endowed with the norm ||/||^/i,2 := + 2Ch(/). A nontrivial approximation 
theorem (see [2, Theorem 6.2]) shows that 

Ch(/) = iinf jliminf j |VA| 2 dm : f h G Lip(X), \\f h - f\\ 2 -+ oj V/ € L 2 (X,m), 

(2.22) 

where |V/| is the local Lipschitz constant of / defined in (2.1). 

Given / 6 W 1 ' 2 (X, d,m), we write d~Ch(f) C L 2 (X,m) for the subdifferential at / of the 
restriction to L 2 (X, m) of Cheeger's energy, namely £ G <9~Ch(/) iff 



Ch( 5 ) > Ch(/) + / £(g - f) dm V 5 G L 2 (X, 
Jx 



m . 



We say that / G L 2 (X,m) is in the domain of the (d, m)-Laplacian, and write / G D(A) (in 
[2] we used the notation A^m to emphasize the dependence on the metric measure structure), 
if d~Ch(f) ^ 0. In this case we define A/ G L 2 (X,m) by Af := —v, where v is the element 
of minimal L 2 (X, m) norm in d~Ch(f). 

We remark that in this generality Ch is not necessarily a quadratic form, which is the 
same as to say that its restriction to L 2 (X, m) is not a Dirichlet form. This means that the 
Laplacian, though 1-homogeneous [2, Remark 4.14], is not necessarily linear. 

For the Laplacian we just defined the following rough integration by parts formula holds: 

J ffA/dm < J \Vg\ w \Vf\ w dm, (2.23) 

for all f,g£ L 2 (X,m) with / G D(A) and g G D(Ch), see [2, Proposition 4.15]. 

The following result is a consequence of the by now classical theory of gradient flows of 
convex lower semicontinuous functionals on Hilbert spaces. 
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Theorem 2.12 (Gradient flow of Ch in L 2 (X, m)) For all f G L 2 (X,m) there exists a 
unique locally absolutely continuous curve (0, oo) 3 t i— > ft G L 2 (X,m) such that ft f in 
L 2 (X, m) as 1 10 and 

^f t G -9-Ch(/0 for a.e. t > 0, 

the derivative being understood in L 2 (A, m). This curve is also locally Lipschitz, it satisfies 
ft G D(A) for any t > and 

^f t = Af t Vt > 0. 

Finally, t \— > Ch(/<) is locally Lipschitz in (0, oo), infinitesimal at oo and, if f & D(Ch), 
continuous in 0. /is ng/zi derivative is given by — ||A/t||| /or every t > 0. 

Finally, we recall a property of the minimal weak gradient of Kantorovich potentials [2, 
Lemma 10.1]: 

Proposition 2.13 Let (X, d,m) G X, /x, z/ G &%{X) with \i > cm /or some c > and Zei 97 

6e a Kantorovich potential relative to (fj,,u). Then 99 is ^niie and absolutely continuous (in 
particular, Sobolev) along a.e. curve and 

IV^lio < |V + 9j| m-a.e. inX. 

As a consequence of the previous proposition, since (2.5) yields |V + c/?| G L 2 (X,fj,), the 
lower bound on \i yields |V<^| W G L 2 (X, m). 

2.4.3 Convex functionals: gradient flows, entropy, and the CD(K, 00) condition 

Let (Y, dy) be a complete and separable metric space, E : Y — > R U {+00}, and A' G R. 
We say that £/ is A-geodesically convex if for any yo, y\ G D(E) there exists 7 G Geo(y) 
satisfying 70 = y , 71 = yi and 

A( 7 i) < (1 - t)E(y )+tE( yi ) - ^t(l - t)d 2 Y (y , yi ) for every t G [0,1]. 



Notice that if E is A-geodesically convex, then D(E) is geodesic in Y and therefore D{E) is 
a length space. 

A consequence of A-geodesic convexity is that the descending slope |V~A| can be calcu- 
lated at all y G D(E) as 

w-E\( V )= SUP (m^m + !i My , z) Y. (2 . 24) 

We recall (see [3, Corollary 2.4.10]) that for A-geodesically convex and l.s.c. functionals the 
descending slope is an upper gradient, in particular the property we shall need is 



E{y s )<E(y t ) + f \y r \\V A|(y r )dr for every s, t G [0, 00), s<t, 

J s 



(2.25) 



for all locally absolutely continuous curves y : [0, 00) — > D(E). A metric gradient flow for the 
A-geodesically convex functional A is a locally absolutely continuous curve y : [0, 00) —> D(E) 
along which (2.25) holds as an equality and moreover \yt\ = \V~E\(yt) for a.e. t > 0, so that 
the energy dissipation rate ^E(y t ) is equal to — \yt\ 2 = — |V~A| 2 (y t ) for a.e. t > 0. 

An application of Young inequality shows that metric gradient flows for A-geodesically 
convex and l.s.c. functionals can equivalently be defined as follows. 
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Definition 2.14 (Metric formulation of gradient flow) Let E : Y — > R U {+00} be a 
K-geodesically convex and l.s.c. functional. We say that a locally absolutely continuous curve 
[0,oo) 3 1 1—)- y t £ D(E) is a gradient flow of E starting from y$ £ D(E) if 

E(y ) = E(y t ) + J^\y r \ 2 + ^\V~E\ 2 (y r )dr Vt > 0. (2.26) 

We now recall the definition of metric measure space with Ricci curvature bounded from 
below by K E R, following [39, §4.2] and [27, §5]. More precisely, we consider here the weaker 
definition of [39] and we will discuss a stronger version in Section 3: see the bibliographical 
references of [43, Chapter 17] for a comparison between the two approaches. 

Definition 2.15 (CD(K, 00) spaces) We say that (X, d,m) E X has Ricci curvature bounded 
from below by K £ R (in short: it is a CD(K, 00) space) if the relative entropy func- 
tional Ent m is K-geodesically convex on (£p2(X), W2), i.e. for any pair of measures fi, v £ 
D(Ent m ) n &2{X) there exists a constant speed geodesic (fit) £ &2(X) such that fiQ = \x, 
fi\ = v and 

K 

Ent m (^ t ) < (1 - t)Ent m (/i ) + iEnt w (/ii) - —t(l - t)W^(fi , Ml) for every t £ [0, 1]. 

Notice that, in comparison with the definition given in [39] and [27] we are restricting the 
analysis to the case of a probability reference measure m with finite second moment (but we 
do not assume local compactness). This is actually unneeded from the "Ricci bound" point 
of view (see also [2, Definition 9.1]), however in this paper we want to focus more on the 
geometrical aspect, rather than on the - non trivial - analytic tools needed to work in higher 
generality: the assumption m £ &2(X) serves to this scope. 

Let us also remark that a CD(K, 00) space (X, d, m) satisfies the length property, i.e. suppm 
is a length space if it is endowed with the distance d [39, Remark 4.6(iii)] (the proof therein, 
based on an approximate midpoint construction, does not use the local compactness). 

Now let (X, d,m) be a CD(K, 00) space. Then, by assumption, the relative entropy 
functional Ent m is -KT-geodesically convex on (& > %{X)^Wi\ so that we could ask about the 
existence and the uniqueness of its gradient flow. The following theorem, proved in [18] for 
the locally compact case and generalized in [2, Theorem 9.3(h)] holds: 

Theorem 2.16 (Gradient flow of the relative entropy) Let (X, d,m) be a CD(K, 00) 
space. Then for any fx £ D(Ent m ) n &2{X) there exists a unique gradient flow of Ent m 
starting from ft. 

Notice that the theorem says nothing about contractivity of the Wasserstein distance 
along the flow, a property which we address in Section 2.5. Actually, Ohta and Sturm proved 
in [41] that contractivity fails if (X, d, m) is R rf endowed with the Lebesgue measure and with 
a distance coming from a norm not induced by a scalar product. 

2.4.4 The heat flow as gradient flow in L 2 (X, m) and in ^(X) 

One of the main result of [2] has been the following identification theorem, see formula (8.5) 
and Theorem 9.3(iii) therein. 
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Theorem 2.17 (The heat flow as gradient flow) Let (X, d,m) be aCD(K,oo) space and 
let f £ L 2 (X,m) be such that p = fm £ ^(X). Let (f t ) be the gradient flow of Ch in 
L 2 (X, m) starting from f as in Theorem 2.12, and let (fit) be the gradient flow of Ent m in 
&2{X) starting from p, as in Theorem 2.16. 

Then p t = f t m for allt>0, 1 t— > Fiiit m (p t ) is locally absolutely continuous in [0,oo), and 



,,.i2_ y iv/, 



|2 



, Ent m (//t) = = / ^^dm /or a.e. t > 0. (2.27) 
dt ^{/t>0} /t 

In other words, we can unambiguously define the heat flow on a CD(K, oo) space either 
as the gradient flow of Cheeger's energy in L 2 (X, m) or as the gradient flow of the relative 
entropy in (^2(X), W2). A byproduct of this proof is also (see [2, Theorem 9.3(i)]) the 
equality between slope and the so-called Fisher information functional: 

|V-Ent ro | 2 (pm) = 4 J iV^dm (2.28) 

for all probability densities p such that y/p £ D(C\r\). Choosing / = ^fp this identity, in 
conjunction with the HWL inequality relating entropy, Wasserstein distance and Fisher in- 
formation (see [26] or [1, Proposition 7.18]) gives the log-Sobolev inequality 

j f log f dm < — J \Vf\ 2 w dm whenever / £ D(Ch) and J fdm= 1. (2.29) 

We will denote by H t : L 2 (X, m) — > L 2 (X,m) the heat semigroup in L 2 (X, m) and by 
rJ%t '■ &2(X) — > &2(X) the gradient flow of the entropy on £p2{X). A distinct notation is 
useful not only for conceptual reasons, but also because the domains of the two gradient flows 
don't match, even if we identify absolutely continuous measures with their densities. 

Some basic properties of the heat flow that we will need later on are collected in the 
following proposition, see [2, Theorem 4.16] also for further details. 

Proposition 2.18 (Some properties of the heat flow) Let (X,d,m) £ X and f £ L 2 (X,m). 
Then the following statements hold: 

(i) (Maximum principle) If f < C (resp. f > C) m-a.e. in X for some C £ M, then 
H t (/) < C (resp. H t (/) > C) m-a.e. in X for any t > 0. 

(ii) (1 -homogeneity) Hj(A/) = AH^(/) for any A £ IR, t > 0. 
2.5 EVI formulation of gradient flows 

Here we recall a stronger formulation of gradient flows in a complete and separable metric 
space (Y, dy), introduced and extensively studied in [3], [13], [35], which will play a key role 
in our analysis. 

Definition 2.19 (Gradient flows in the EVI sense) Let E : Y — > R U {+00} be a lower 
semicontinuous functional, K £ K and (0, 00) 3 t i->- y% £ D(E) be a locally absolutely 
continuous curve. We say that (yt) is a K -gradient flow for E in the Evolution Variational 
Inequalities sense (or, simply, it is an EVI^ gradient flow) if for any z £ Y it holds 

^4^) + K_^ {yu z) + E{yt) < E{z) for a e t G (0) ^ {2 30) 



Iflimyt = yo £ D(E), we say that the gradient flow starts from y$. 
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Notice that the derivative in (2.30) exists for a.e. t > 0, since 1 1— >■ dy (yt, z) is locally absolutely 
continuous in (0,oo). 

In the next proposition we will consider equivalent formulations of (2.30) involving subsets 
D C D(E) dense in energy: it means that for any y G D(E) there exists a sequence (y n ) C D 
such that dy (y n , y) — > and E{y n ) — > E(y) as n — > oo. 

Proposition 2.20 (Equivalent formulations of EVI) Let E, K be as in Definition 2.19, 
D C D(E) dense in energy, and y : (0, oo) — > D(E) be a locally absolutely continuous curve 
with lim^o Vt = Vo £ D(E). Then, (yt) is an EVI^ gradient flow if and only if one of the 
following properties is satisfied: 

(i) (Dense version) The differential inequality (2.30) holds for all z £ D. 

(ii) (Integral version) For all z 6 D it holds 



Uv 

2 



(y u z) - Y Y Z) <l K (t- s) (e(z) - E(y t j) for every 0<s<t, (2.31) 



where iR-(i) := J * eKr dr. 
(Hi) (Pointwise version) For all z £ D it holds 

r dyiyt+hiz) - d^(y t ,z) K 2 a i t?( \ ^ t?( \ t , xn 

hmsup-!^ 1 h— o Y {yt, z)+E(yt) < E(z) for every t > 0. (2.32) 

hiO 2 2 

Proof. To get (2.31) for all z 6 D(E) from (2.30), just multiply by e Kt and integrate in 
time, using the fact that t \— > E(yt) is nonincreasing (see e.g. [10] and the next Proposition); a 
differentiation provides the equivalence, since y is absolutely continuous. The fact that (2.31) 
holds for any z if and only if it holds in a set dense in energy is trivial, so that the equivalence 
of (ii) and Definition 2.19 is proved. The equivalences with (i) and (Hi) follow by similar 
arguments. □ 

We recall some basic and useful properties of gradient flows in the EVI sense; we give 
here the essential sketch of the proofs, referring to [3, Chap. 4] and [35] for more details and 
results. In particular, we emphasize that the maps S( : yo i— )• yt that at every yo associate the 
value at time t > of the unique K-gradient flow starting from yo gi ye raise to a continuous 
semigroup of K-contractions according to (2.33) in a closed (possibly empty) subset of Y. 

Proposition 2.21 (Properties of gradient flows in the EVI sense) LetY, E, K, y t be 
as in Definition 2.19 and suppose that (yt) is an EVIa' gradient flow of E starting from yo- 
Then: 

(i) If yo G D(E), then yt is also a metric gradient flow, i.e. (2.26) holds. 

(ii) If (yt) is another EVI^ gradient flow for E starting from yo, it holds 

MvuVt) < e- X *dy(y ,y ). (2.33) 

In particular, EVIx gradient flows uniquely depend on the initial condition. 

(Hi) Existence of EVI k gradient flows starting from any point in D <ZY implies existence 
starting from any point in D. 
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(iv) (yt) is locally Lipschitz in (0, oo), yt £ D(\V E\) for every t > 0, the map t t- > 
e Kt \\7~E\(yt) is nonincreasing, and we have the regularization estimate 

I K (t)E(y t )+ ( lK ^ \V-E\ 2 (y t ) <l K (t) E(z)+^d 2 y(z,y ) Vt > 0, z G D(E). (2.34) 

Proof. The fact that EVI^ gradient flows satisfy (2.26) has been proved by the third author 
in [35] (see also [1, Proposition 3.9]). The contractivity property (ii) has been proved in [3, 
Chap. 4]. Statement (Hi) follows trivially from contractivity and integral formulation (2.31) 
of the EVI. The fact that t ^ e Kt \V~ E\(y t ) is nonincreasing follows from the energy identity, 
which shows that |V~ E\(yt) = \yt\, and the .fT-contraction estimate (2.33), which in particular 
yields that 1 1-)- e Kt dY(yt,Vt+h) is nonincreasing as well as t \- > e Kt \i/t\. 

An easier regularization formula for t \— > E(yt) follows immediately by (2.31) by choosing 
s = and neglecting the term proportional to dy (yt, z). Inequality (2.34) is a consequence of 
the EVIft-, the identity ^E(y t ) = — \V~E\ 2 (yt), the previous monotonicity property and the 
following calculations: 

l -(l K (i)f\V-E\ 2 {y t ) = l -(l_ K {t)fe 2Kt \V~E\ 2 {y t ) < j\„ K (s)e- Ks e 2Ks \V- E\ 2 (y s ) ds 

= - I' l. K (s)e Ks {E(y s ) - E(y t ))' ds = f e Ks (E(y s ) - E(y t )) ds 
Jo Jo 

< J* -±(e K 'd$(y„ z))' + e Ks (E(z) - E(y t )) ds < ±d 2 Y (y , z) + l K {t){E{z) - E(y t )). 

□ 

We point out that in general existence of EVIx gradient flows is a consequence of the K- 
geodesic convexity of E and of strong geometric assumptions on the metric space (Y, dy): it is 
well known when Y is a convex set of an Hilbert space, but existence holds even when (Y, dy) 
satisfies suitable lower sectional curvature bounds in the sense of Alexandrov [20, 29, 34, 35], 
or when suitable compatibility conditions between E and d hold [3, Chapter 4] which include 
also spaces with nonpositive Alexandrov curvature. In the present paper we will study an 
important situation where EVI^ gradient flows arise without any assumption on sectional 
curvature. 

In any case, EVI^ gradient flows have the following interesting geometric consequence on 
the functional E [13, Theorem 3.2]: if EVI^ gradient flows exist for any initial data, then 
the functional is .fT-convex along any geodesic contained in D(E). Recall that the standard 
definition of geodesic convexity, e.g. the one involved in Definition 2.15 of CD(K, oo) metric 
measure spaces, requires convexity along some geodesic; this choice is usually motivated 
by stability properties w.r.t. T-convergence [3, Thm. 9.1.4] and Sturm-Gromov-Hausdorff 
convergence in the case of metric measure spaces (see also the next section). We state this 
property in a quantitative way, which will turn out to be useful in the following. 

Proposition 2.22 Let E, K , yt be as in Definition 2.19 and assume that for every yo £ D(E) 
there exists the EVIif gradient flow yt '■= St(yo) for E starting from yo- If e > and 
7 : [0, 1] —7- D(E) is a Lipschitz curve satisfying 

dr(7si)7s 2 ) < L\si - s 2 \, L 2 < d y (7 , 71) + e 2 for every s\, s 2 e [0,1], (2.35) 
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then for every t > and s G [0, 1] 

# (S t ( 7s )) < (1 - «)#(lto) + sE( yi ) - y s (l - s)d y (y , l/i) + ^|^y s (l " «)• (2-36) 

In particular E is K -convex along all geodesies contained in D(E). 

The last statement is an immediate consequence of (2.36) by choosing e = and letting 

1 4, o. 

3 Strong CD(K, oo) spaces 

Definition 3.1 (Strong CD(K, oo) spaces) M^e say £/iaf (X, d,m) zs a strong CD(K, oo) 
space if for every /j,q, G £)(Ent m ) n .^(X) ^ere exists an optimal geodesic plan 7r /rom /^o 
to /ii such that K -convexity of the entropy holds along all weighted plans tvf := Fir, where 
F : Geo(X) — > R is any Borel, bounded, non negative function such that J F dir = 1. More 
precisely, for any such F, the interpolated measures p,F,t '■= ( e t)^F satisfy: 

Ent w (^,i) < (1 - t)Ent m (nF,o) + tEnt m (n F ,i) - y*(l - t)Wf (Mf,o, A*f,i) Vt G [0, 1]. 

It is unclear to us whether this notion is stable w.r.t D-convergence or not. As such, 
it should be handled with care. We introduced this definition for two reasons. The first 
one is that applying Proposition 2.22 we will show in Lemma 5.2 that if a length metric 
measure space (X, d,m) admits existence of EVI^- gradient flows of Ent m for any initial 
measure p, G Z?(Ent m ) n &2{X), then it is a strong CD(K, oo) space. Given that spaces 
admitting EVI^ gradient flows for Ent m are the main subject of investigation of this paper, 
it is interesting to study a priori the properties of strong CD(K, oo) spaces. The other reason 
is due to the fact that the proof that linearity of the heat flow implies the existence of EVI^ 
gradient flows of the entropy requires additional L°°-estimates for displacement interpolations 
which looks unavailable in general CD(K, oo) spaces. 

Remark 3.2 (The nonbranching case) If a space (X, d,m) is CD(K, oo) and nonbranch- 
ing, then it is also strong CD(K, oo) according to the previous definition. 
Indeed, pick po, p\ G D(Ent w ), let 7r G GeoOpt(//o, A*i) be such that the relative entropy 
is if-convex along ((et)j7r), so that Ent m ((et)j7r) is bounded in [0,1]. Now, pick F as in 
Definition 3.1, let pp,t '■= {^t)^F and notice that the real function s i— > <p(s) := Ent m (^p iS ) is 
bounded (thus in particular pp,s G _D(Ent m )) in [0, 1], since pF,t < sup |F|(et)p7r. 
The nonbranching assumption ensures that for any t G (0,1) there is a unique geodesic con- 
necting pp,t to fiF,o (and similarly to pf,i)- Hence, since (X, d,m) is a CD(K, oo) space, the 
restriction of (j) to all the intervals of the form [0, t] and [t, 1] for t G (0,1) is fT-convex and 
finite. It follows that (j) is /^-convex in [0, 1]. ■ 

In order to better understand the next basic interpolation estimate, let us consider the 
simpler case of an otimal geodesic plan 7r G GeoOpt( / uo, p>i) in a nonbranching CD(K, oo) 
space (X, d,m). Assuming that p\ = pim G D(Ent m ) and setting p t := p t m, along 7r-a.e. 
geodesic 7 the real map t i-> logp<(7<) is K-convex and therefore pt(~ft) can be pointwise 
estimated by [43, Thm. 30.32, (30.51)] 

Pt(lt) < e-^* (1 -* )d2(7o ' 7l) Po(7o) 1 ~Vi(7i)* for every t G [0, 1], for vr-a.e. 7. (3.1) 
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Inequality (3.1) for smooth Riemannian manifolds goes back to [11]. If pi have bounded 
supports, one immediately gets the uniform L°°-bound: 



llftlloo < e* * (1 t)s2 ||pollL* IIPillL. with S 1 := sup {d(xo,X!) : Xi G supp(^)}. (3.2) 

When > (3.2) is also a consequence of the definition (stronger than (2.15)) of spaces with 
non- negative Ricci curvature given by [27], which in particular yields the geodesic convexity 
of all the functionals U p {p) := J p v dm whenever p = pm and p > 1. 

If we know that only p\ is supported in a bounded set, we can still get a weighted L°°- 
bound on p^. Let us assume that 

supppi C C, with Di := diam(C) < oo, D(x) := dist(x, C) for x £ X, (3.3) 

and let us observe that for 7r-a.e. 7 we have 71 G supp/ii, so that for every t G [0, 1) it holds 

j/ x d( 7 j,7i) D(7t) + Di ,„ ,x 

d(7o,7i)= j— ^ < — 7—^ — . (3-4) 

1 — 2t 

D( 7 o) > d(7 t ,7i) - Di -d( 7o , 7t ) = (1 - 2t)d( 7o ,7i) - Di > -y— £-D( 7 t) ~ DL ( 3 - 5 ) 



Substituting the above bounds in (3.1) we get 

il-t 

\L°°(R(D(x),t);m.) 



Pt(x) < e^ra(D(-)+D i ) 2 ||^ o ||i~t ii^n^ m . a _ e . in X; (3 _ 6) 



where 

t) := {y G X : D(y) > £> - Di}, D > 0, t G [0, 1). (3.7) 

The next lemma shows that the strong CD(K, 00) condition is sufficient to obtain the 
same estimates. 

Proposition 3.3 (Interpolation properties) Let (X, d,m) be a strong CD(K, 00) space 
and let po, p\ be 'probability densities such that pi = pirn G D(Ent m ) n Assume 
that pi is bounded and with support contained in a bounded set C as in (3.3) and let ir G 
GeoOpt(/xo, p-i) as in Definition 3.1. Then for all t G [0,1) the density p t of p t = (e t ){j-7r 
satisfies (3.6). Furthermore, if also po is bounded with bounded support, then (3.2) holds and 
sn Pt \\Pt\\°o < 00. 

Proof. Let 7r be given by the strong CD(K, 00) condition. Fix t G (0, 1) and assume that 
(3.6) does not hold on a Borel set B of positive m-measure. Then we can find a Borel set 
A C B with m(A) > such that 

Pt(x) > e^m^+^V^IIp!^ Vx G A, 

where 

M : = l|po|U°°(ii(D 2 ,t);m)> D i : = supD(x), D 2 := inf D(x). (3.8) 

To build A, it suffices to slice B in countably many pieces where the oscillation of D is 
sufficiently small. We have 7r((et)~ 1 (A)) = pt(A) > 0, thus the plan 7r := c7rLe < T 1 (^4), 
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where c := 1 is the normalizing constant, is well defined. Let p s be the density of 

£k s = (e s )jj7r. By definition it holds pt = cpt on A and pt = on X \ A, thus we have: 

f K~ t 2 

Enk m (fi t )= / p t log p t dm >logc+— — -(Di + Di) + (1 - t) log M + ilog ||pi|U- (3.9) 

On the other hand, we have po < c Po an d pi < cpi hence 

Ent m (/2 ) = J log(p oe )d7r < log c + log (||po ° eo|U°°(Geo(x),#))i ( 3 - 10 ) 

Ent m (Ai) = J log^i oei)dfr < logc + logHpiHoo. (3.11) 

Now observe that 7r-a.e. geodesic 7 satisfies jt G A and 71 G supppi C C, so that (3.4) and 
(3.5) yield 

Di + Di ^. . 1 - 2^ ^. , 
d(7o,7i)< , D( 7o ) > ^— -D 2 -Di, i.e. 7 oG J R(D 2 ,t). 

Integrating the squared first inequality w.r.t. tt and combining the second one, (3.10), and 
(3.8) we get 



\_ t ) > Ent m (Ao) < logc+(l-t)logM. (3.12) 

Inequalities (3.9), (3.11), and (3.12) contradict the if -convexity of the entropy along ((e s )tj7r), 
so the proof of the first claim is concluded. 

The proof of (3.2) when also po has bounded support follows the same lines just used. 
Let t G (0, 1) and assume that (3.2) does not hold. Thus there exists a Borel set A of 
positive m-measure such that pt > e K t ( 1 ~ t ) s ^Hpoll;^* llPilloo m A. As before, we define 
7T := c7rLe < T 1 (^4), where c is the normalizing constant, and p s as the density of (e s )(j7r: the 
inequalities 

K~ 

Ent m ((e t ) fl 7f) > logc + — t(l - t)S 2 + (1 - 1) log ||p ||oo + * log \\pi\\oo, 
Ent m ((e )tj7r) < log c + log || 

PO 1 1 OO 5 Ent m ((ei) tt 7r) <logc + log|| 

Pi || 00 5 

W 2 2 ((eo) tt 7r,(e 1 ) tt 7r) <S 2 , 

contradict the if-convexity of the entropy along ((e s )jj7r). □ 

In the sequel we will occasionally use the stretching/restriction operator restrg in C([0, 1]; X), 
defined for all s G [0, 1] by 

restrg( 7 ) t := Jts t € [0,1]. 

Proposition 3.4 (Existence of test plans) Let (X, d,m) be a strong CD (K, 00) space and 
let po, pi be probability densities. Assume that p\ is bounded with bounded support as in (3.3), 
that po is bounded and satisfies 

po(x) < ce -9 K -( D ( x )-C) 2 whenever D(x) := dist(x, supp pi) > R, (3.13) 

for some nonnegative constants c, C, R. Then, for it G GeoOpt(pom, pim) as in Defini- 
1/3 

tion 3.1, (restr )^7T is a test plan (recall Definition 2.9). 
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Proof. In order to avoid cumbersome formulas, in this proof we switch to the exp notation. 
We need to prove that sup x pt is uniformly bounded in [0, 1/3]. Let Di = diam(supp pi), M 
the function defined in (3.7), L a constant to be specified later, A := {y : D(y) < L} and set 
7T 1 := 7rLeg 1 (t4), it 2 := 7v\_e^ 1 (X \ A). Choosing L large enough we have a := 7r(eQ 1 (^4)) > 
and we can also assume that a < 1 (otherwise, po has bounded support and the second 
part of Proposition 3.3 applies). Also, possibly increasing R and taking (3.13) into account, 
we can assume that po(x) < 1 wherever D(x) > R. 

Denoting by 7r , 7r 2 the corresponding renormalized plans, it suffices to show that both 
have bounded densities in the time interval [0,1/3], because 7r is a convex combination of 
them. Concerning # , notice that both (eo^Tr 1 and (ei)jjTr 1 have bounded support and 
bounded density, so that the conclusion follows from the second part of Proposition 3.3. 

For 7r 2 we argue as follows. Pick 7 G supp fr 2 and notice that 71 G supp p\ and t < ^ give 
the inequality 

D(7t) > d(7t,7i) - Di = (1 - t)d( 7 o,7i) - Di > (1 - t)D( 7o ) - Di > -£>( 7 o) - Di. 
So, choosing L sufficiently large (depending only on Di and R), we have 

7oSl\i - Di > R. 

Recalling the definition (3.7) of R(D,t) and using the fact that t G [0, 1/3], we get that 

y G R(D(jt), t) D(y) > i^D( 7t ) - Di > _ Di > R for all 7 G supp tt 2 , 

and therefore (3.13) gives 

sup po < cexp( — 9iT~(D(7 t ) — Di — C) J for all 7 G suppfr 2 . 

«(D(T*),t) V 7 

Now, by applying (3.6) to 7r 2 , we get that the density t/ 4 of (et)^7r 2 satisfies 

^t) < ^;exp(— (D( 7t ) + Di) 2 )||p |lI^ (i?(D(70ii)im) ||pi||'oo for fr 2 -a.e. 7. 
Using the fact that t varies in [0, 1/3] and po < 1 in i?(D(7 t ),i), we eventually get 

1 /3 

milt) < C ^^exp(^(D(7 t ) + Di) 2 - 6if~(^M _ Di _ c)') for 7r 2 -a.e. 7. 

Since — |X~D 2 (7() is the leading term in the exponential, the right-hand side is bounded and 
we deduce that ||r/t||oo = \\f]t e t lli°°(Geo(x) tt 2 ) 15 uniformly bounded. □ 

Proposition 3.5 (Metric Brenier theorem for strong CD (K, 00) spaces) Let(X, d,m) G 
X be a strong CD(K, 00) space, x G X, p = p m G with 

< cr < po < c^ 1 m-a.e. m Br{xq) for every R > 0, (3-14) 
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and p\ £ &2{X) with bounded support and bounded density. Then, for iv £ GeoOpt(/io, A*i) 
as in Definition 3.1, there exists L £ 1?{X, po) such that 

L(7o) = d (70,71) for -rr -a.e. 7 £ Geo(X). 

Furthermore, 

L(x) = \Vtp\ w (x) = |V + (/?|(x) for fiQ-a.e. x £ X, 
where <p is any Kantorovich potential relative to (po,p\). 

Proof. We apply the metric Brenier Theorem 10.3 of [2] with V{x) = A{x,xq). To this aim, 
we need only to show that 

(et) tt 7r(B n B R (x Q )) < C(R)m(B) for every t £ [0, 1/2], B £ 3§{X), R>0. (3.15) 

Denoting by R\ the radius of a ball containing the support of fii, notice that if a curve 7 in 
the support of 7r hits Br(xo) at some time s £ [0, 1/2], then 

d(7o,7i) < 2d( 7s , 7 i) < 2(R + R 1 ) 

because 71 £ Bji 1 (xo). Possibly restricting 7r to the set of 7's hitting Br(xo) at some s £ 
[0,1/2], an operation which does not affect ((et)jj7r) \_Br(xq) for t £ [0,1/2], we get that 
(eo)(j7r, (ei)jj7r have bounded support and bounded densities, thus the conclusion follows from 
the second part of Proposition 3.3. □ 



4 Key formulas 

4.1 Derivative of the squared Wasserstein distance 

In this short section we compute the derivative of the squared Wasserstein distance along a 
heat flow. 

Theorem 4.1 (Derivative of squared Wasserstein distance) Let(X, d,m) be a CD(K, 00) 
space, h = pm £ &2(X) such that 0<c<p<C<oo and define pt "■= ^?(a*) = Pttn. Let 
a £ &2{X) and for any t > let ipt be a Kantorovich potential relative to (/i^cr). Then for 
a.e. t > it holds 

< chte-^)-ch(p,) V£ > Q (41) 

Proof. Since 1 1— > ptm is a locally absolutely continuous curve in &2(X), the derivative at the 
left hand side of (4.1) exists for a.e. t > 0. Also, the derivative of 1 1— > pt = H^(p) £ L 2 (X, m) 
exists for a.e. t > 0. Fix to > where both derivatives exist and notice that since <pt is a 
Kantorovich potential for (pt , cr) it holds 

^W£(fM to ,o-) = J (pto dp t0 + J (p c t0 da 

T^Hnto-h, cr) > y <p to d/ito-^ + y ^ Q do- for all h such that t - h > 0. 
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Taking the difference between the first identity and the second inequality, dividing by h > 0, 
and letting h — > we obtain 



d_l 

dt2 



< liminf 



X 



¥*0 



Ptp — Pt -h 

h 



dm. 



Now recall that (ft G L X (X, pt Q ), so that by our assumption on p and the maximum principle 
(Proposition 2.18) we deduce that ip ta G L 1 (X, m). By Proposition 2.13 we have IV^Jw G 
L 2 {X,m). 

Now, if ipt G L 2 (X, m) the estimate of the liminf with the difference quotient of Ch is 
just a consequence of the following three facts: the first one is that, for all t > we have 
/i _1 ( / o f+ / l — pt) Ap t as h I in L 2 (X, m); the second one is that we have chosen to such 
that the full limit exists; the third one is the inequality 

Ch(p to ) + e j <pt Ap to dm < Ch(p to - eip to ) Ve > 

provided by the inclusion — Ap to G d~Ch(p to ). 

For the general case, fix to > as before, e > and let (p N := max{min{y? to , N}, —N} G 
L 2 (X,m) be the truncated functions. Since the chain rule (2.20) gives (V^l^ < |V^ |iu, the 
locality of the minimal weak gradient (2.18) and the dominated convergence theorem ensures 
that Ch(pt — E(p N ) — > Ch(pt — etpt ) as N — > oo. Applying Lemma 4.2 below with / := ipt — <p N 
we get 



sup 

/iG(0,< /2) 



/ N\ Pto ~ PtQ-h j 

((pto -if JV ) r dm 



to 



< sup — 



he(0,t /2) 



h 



\V<Pt \l,P8 dm 



to-h {\ft \>N} 



^dm)d,s, 

Ps 



and hence 



lim sup sup 

N^oo he(0,t /2) 



VPt -V ) r dm 



0. 



which is sufficient to conclude, applying the liminf estimate to all functions ip N and then 
passing to the limit. □ 



Lemma 4.2 With the same notation and assumptions of the previous theorem, for every 
f G L X {X, m) and [s,t] C (0, oo) it holds 



/^idm 
t — s 



< 



1 



t - s 



iV/l^dm 



|Vp s 



dm dr. 



Proof. Assume first that / G L 2 (X,m). Then from (2.23) we get 



fAp r dm 



< (J |V/UVp r | to dtn) < J |V/|* Pr dm y^^dm, 



(4.2) 



for all r > 0, and the thesis follows by integration in (s,t). 
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For the general case, let / := max{min{/, N}, — N} G L 2 (X,m) be the truncated func- 
tions. By Proposition 2.18(i) we know that pt — p s G L°°(X,m), so that 

lint / f N ^^dm= [ f P -^dm, 

TV-i-oo J t — S J t — S 

by dominated convergence. Also, by the chain rule (2.20) we have \Vf N \ w < |V/|u, m-a.e. in 
X. The conclusion follows. □ 



4.2 Derivative of the entropy along a geodesic 

We now look for a formula to bound from below the derivative of the entropy along a geodesic, 
which is going to be a much harder task compared to Theorem 4.1, due to the lack of a change 
of variable formula. From the technical point of view, we will need to assume that (X, d,m) 
is a strong CD(K, oo) space, it order to apply the metric Brenier theorem 3.5. From the 
geometric point of view, the key property that we will use is given by Lemma 4.5 where 
we relate "horizontal" to "vertical" derivatives. In order to better understand the point, we 
propose the following simple example. 

Example 4.3 Let || • || be a smooth, strictly convex norm on and let || • ||* be the dual 
norm. Let L be the duality map from (R rf , || • ||) to || • ||*) and let L* be its inverse 



(respectively, the differentials of the maps ^|| ■ || 2 and ||| • || 2 ). For a smooth map / : M. d — > R 
its differential Df{x) at any point x is intrinsically defined as cotangent vector. To define 
the gradient Vg(x) of a function g : M. d — > K (which is a tangent vector), the norm comes 
into play via the formula Vg(x) := L*(Dg(x)). Notice that the gradient can be characterized 
without invoking the duality map: first of all one evaluates the slope 

\Vg\(x) :=hmsup = \\Dg(x)\U; (4.3) 

y->x lF-y|| 



7o = x, — g(jt)\ t=0 = IItoII 2 = |V#| 2 (x). 



then one looks for smooth curves 7 : (-5,6) -> R d such that 

d_ 

^w./|t=o 11 -11 1 --1 v-/- ( 4-4 j 
In this case Vg(x) = 70 and |V<?|(:e) = ||V^(x)||. 

Now, given two smooth functions /, g, the real number Df(Vg)(x) is well defined as the 
application of the cotangent vector Df(x) to the tangent vector V <?(#). 

What we want to point out, is that there are in principle and in practice two very different 
ways of obtaining Df(Vg)(x) from a derivation. The first one, maybe more conventional, is 
the "horizontal derivative": 

Df(Vg)(x) = Df(jo) = lim ^ f_l®_ where 7 is any curve as in (4.4). 

t — t 

The second one is the "vertical derivative" , where we consider perturbations of the slope 



Df(Vg)(x) = lim 



^l|V(<7 + £ /)|| 2 (x)-i||V 5 || 2 (x) 



It coincides with the previous quantity thanks to the "dual" representation (4.3). 
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We emphasize that this relation between horizontal and vertical derivation holds in a 
purely metric setting: compare the statement of the example with that of Lemma 4.5 below 
(the plan ir playing the role of a curve 7 as in (4.4), moving points in the direction of — Vy). 

For 7 G AC 2 ([0, 1];X) we set 



Etil) :-- 



't 



l7s| 2 ds. 



(4.5) 



Notice that £^(7) reduces to d(7o,7t) if 7 € Geo(X). In the sequel it is tacitly understood 
that the undetermined ratios of the form 

filt) ~ /(70) 
Etil) 

are set equal to whenever £^(7) = 0, i.e. 7 is constant in [Q,t]. 

Recall that he notion of negligible collections of curves in AC 2 ([0,1];X) has been intro- 
duced in Definition 2.9. 

Lemma 4.4 Let f : X — )• R be a Borel function, Sobolev on almost every curve, such that 
|V/|iu G L?(X, m), and let 7r be a test plan. Then 

2 



lim sup 

40 



Kit) - /(70) 



d7r( 7 )< / |V/| 2 y (7o)d7r( 7 ). 



(4.6) 



In particular, assume that it G GeoOpt(/x, v) with fj,, v <C ttl with bounded densities, v with 
bounded support, fj, > cm for some c > and let (p be a Kantorovich potential relative to it. 
Then it holds 



lim 

40 



|V^| w (7o) in L z (Geo{X),7v). 



Proof. For any t G (0, 1) and 7r-a.e. 7 it holds 



f(lt) ~ /(7o) 



Et(l) 



< 



fo\Vf\ w (ls)\is\ds 



< 



\Vf\l(ls)ds. 



(4.7) 



(4- 



Hence 



/ 



filt) - /(70) 



d7r( 7 ) < - 



|V/| 2 ( 7s )d S d7r( 7 ) 



p s ds |V/|2 dm, 



where p s is the density of (e s )jj7T. Now notice that p t m — > pom as t 1 in duality with 
continuous and bounded functions and that sup t ||pt|| 00 < 00. Hence pt — > po weakly* in 
L°°(X, m) and the conclusion follows from the fact that |V/| 2 G L 1 (X, m). 

The second part of the statement follows by [2, Theorem 10.3], Proposition 3.5, and the 
identity E t (-y) = d(7o,7t) since 7r in this case is concentrated on Geo(X). □ 



Lemma 4.5 (Horizontal and vertical derivatives) Let f,g:X^-M,be Borel functions, 
Sobolev on almost every curve, such that both |V/|k, and \ Vg\ w belong to L 2 (X,m), and let 
it be a test plan. Assume that 



lim 

40 



g(7o) ~ g(lt) 
Et{l) 



Et{l) 



lim 

40 t 



\Vg\M znL 2 (AC 2 ([0,l];X),7r) 



(4.9) 
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Then 

hMfW-"*»J.[,| > f „ up f«!?qb±*W d „ W , (4.10) 
Proof. Define functions F u G t : AC 2 ([0, 1];X) -»■ R by 

P / \ /(to) - f(rft) n , \ g(7o) - g(7t) 

Fi(7) •= ^(7) ' G * (7) = ^(7) ' 

By (4.9) we get 

lim j G?d7r = j |V 5 |2( 70 )d7r( 7 ). (4.11) 



Applying Lemma 4.4 to the function g + ef we obtain 
|V(5 + e/)l^(7o)d7r( 7 ) > limsup 



40 

> 



(g + £f)(io)-(g + ef)(j t ) 2 



Mi) 

limsup I (G? ( 7 ) + 2eG t F t ) dir( 7 ). 

40 J 



d7r(7) 



(4.12) 



40 

Subtracting this inequality from (4.11) we get 

1 1 gcM^ifaiM d7r(7) , m ^ f _ j Gtb)Ftb) Mi) 

By assumption, we know that \\Gt — Et/tW? as i 0. Also, by Lemma 4.4, we have 
sup t ||-Pf||2 < 00 • Thus it holds 

liminf - j G t {i)F t {i) d?r( 7 ) = liminf - J ^^F t ( 7 ) d7r( 7 ) = liminf j ~ ^ 7o ^ d7r( 7 ). 

□ 

Before turning to the proof of the estimate of the derivative of the entropy along a geodesic, 
we need two more lemmas. 

Lemma 4.6 Let (X, d, m) be a metric measure space, it a test plan and Xn '■ X — > [0, 1] mono- 
tonically convergent to 1. Define the plans 7r n := c n (x n ° eo)7r, where c n is the normalizing 
constant. Then 

lim Ent m ((e t ) tt 7r n ) = Ent m ((e t ) S 7r) Vt € [0, 1]. 

n— s-oo 

Proof. If p n> t are the densities w.r.t. m of (et)ft(xn eo7r n ), by monotone convergence we 
have J p ny t log p n j dm — > Ent m ((et)j7r). Since c n 1 1 the thesis follows. □ 

Lemma 4.7 Lei /, g : X — >• R be Sobolev functions along a.e. curve, let J be an interval 
containing g(X) and let <f> : J — > R 6e nondecreasing, Lipschitz and C . Then 

lim ga±#M)&^a = , Um iv(/ +ei »a-iv/ft m _ „ x 

e^o e £j,o e 

Similarly, under the same assumptions on (j), if Ch is a quadratic form and |V/| W € L 2 (X,m), 
it holds 

lim / ivtw +*/)&- 1 vwia, dm= /^ (9)lmi iZto±£fl!i^£ dm . 

£ ;o 7 e y r vy; £40 e 
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Proof. Let us consider the first equality. Notice that it is invariant under addition of 
constants to <j> and multiplication of <j) by positive constants, hence if <fi is affine the thesis 
is obvious. In addition, since (V/ij^ = m-a.e. in all level sets /i _1 (c), the formula holds 
m-a.e. on any level set of g. Then, by locality, the formula holds if (f> is countably piecewise 
affine, i.e. if there is a partition of J in countably many intervals where eft is affine. In the 
general case, thanks to the C 1 regularity of (j), for any 5 > we can find a countably piecewise 
affine (f>$ such that 11(0 — (j)s)'\\oo < <5 an d use the estimate 



||V(/ + e0^))|«,-|V(/ + e^))|„| <e|V(0-^)( ff )U <eS\Vg\ w 

to conclude the proof of the first equality. 

The second integral equality immediately follows by the integrating the first one, pulling 
the limit out of the integral in the left hand side using the dominated convergence theorem 
and finally using the identity 

Q{u + ev) - Q(u) - e 2 Q(v) = Q(v + eu) - Q(v) - e 2 Q{u) (4.13) 

(with u = 4>(g), v = f) satisfied by all quadratic forms Q. □ 
We are finally ready to prove the main result of this section. 

Theorem 4.8 (Derivative of the entropy along a geodesic) Let (X, d,m) G X be a strong 
CD(K, oo) space and let o"o, o~\ be •probability densities. Assume that o~\ is bounded with 
bounded support as in (3.3), and let if be a Kantorovich potential from corn to aim. Then: 

(a) if cp G L 2 (X, m), logcro G D(Ch) and 

Po{x) < ce~ 9K ( D ( :r ) _c ') whenever D(x) := dist(x, supp pi) > R for some c, C, R, 

(4.14) 

then for any 7r G GeoOpt(o"otn, aim) as in Definition 3.1 and pt = (e*))! 71 " it holds 

lim . nf Ent m (p t ) ~ Ent m (/, ) Ch(yQ - Ch(y> + ea ) ^ > 
40 t £ 

(b) ifO<c<aQ<C<oo for some constants c, C, then 

Ent m ( f T 1 m)-Ent m (aom)-^W 2 2 (a m,a 1 m) > Ch (^) ~ Ch(^ + ea ) ^ > Q ^ ^ 

Proof, (a) Since logoo G D(Ch) and o"o is bounded we have Ch(oo) < oo. By Proposition 3.4 
we get that (restr^ 3 )^ is a test plan. 

Now observe that the convexity of z i— > z log z gives 

Ent m (/i t ) - Ent m (/z ) . /" , 0t - cr /" log(cr o e t ) - log(<r o e ) , , 

> / log do dm = / d7r. v 4 - 1 ') 



t ~ J t J t 

Here we make the fundamental use of Lemma 4.5: take / := log do, g := <p and notice that 
thanks to Proposition 3.5 and the second part of Lemma 4.4 applied with 7r, the assumptions 
of Lemma 4.5 are satisfied by 7r. Thus we have 

liminf ( lQg ^° - ^ - lQg ^° - dTr > / l V(/? l^ 70 ^ ~ l V (^ + gl °g (7 o)^(7o) d? / n 



40 J t J 2e 

|2 



|!v*HV| ±£ iog^ odm (41g) 
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Since log<7o £ -D(Ch), the integrand in the r.h.s. of (4.18) is dominated, so that Lemma 4.7 
yields 



Km /■ivA-iv(^iog*o)s godm = /■ lin iv^-iv( y+£ iog.,))a (rodm 

40 J £ J £4.0 £ 

lim |V^-|V(^^o)l^ dm 

£4.0 £ 

im / |V^-|V(^^o)l^ dm 



e4.0 J £ 

2 lim Ch ^^ - Ch ^ + ga °) 

e\o £ 

Now the convexity of Ch together with (4.17) and (4.18) proves (4.15). 

(b) Observe that by Proposition 3.5 we know that J |V<^|^ d/^o < 00, so that the lower bound 
Mo > cm gives |Vy| w G L 2 (X,m) and the statement makes sense. We can also assume 
Ch(o"o) < 00, indeed if not the inequality |V(</2 + £<jq)\ w > £|Vo"oU — |V<^|^ implies Ch(<^ + 
£(Jo) = 00 and there is nothing to prove. 

Let D(x) := d(x, supp a\) and h n : [0,oo) — > [0, 00) be given by 

hn (r) :=e -M-((r-n)+)\ 

Define the cut-off functions X n (x) ■= h n (D(x)), and notice that since the h^s are equi- 
Lipschitz, so are the X n 's. 

Notice that X n t 1 m X as n ~ ► °°- Define 7r n := c n (X n eo) n £ GeoOpt(crotn, (Xittt) 
as in Definition 3.1 and c n 1 1 being the normalizing constant, and /if := (ei)jj7r n = crfm. 

We claim that Ch(</7 + £<Jq) converges to Ch((p + eao) as n — > 00. To prove the claim, let 
L be a uniform bound on the Lipschitz constants of the X n and notice that the inequality 
(2.21) yields |V(X n co)U < X n \Vao\w +0o|VX n U < |V<7o| w + LUcjUoo, so that the sequence 
(iV(XnO'o)U) is dominated in L 2 (X, m). Now just observe that by the locality principle (2.18) 
we have |Vao|«, = |V(XnCo)U m-a.e. in {X n = 1}- 

Taking the previous claim into account, by Lemma 4.6 we have that Ent m (/i") — > Ent m (/^) 
for any t € [0, 1], so that 

K 

Ent m (crim) - Ent m (cr m) - —W 2 (a m.,a 1 m) 

K 

= lim Ent m (cf m) - Ent m (<7?m) W|(a?m, of tn) 

n— >oo 2 

> lim inf Um i n f E^ W .n)-Ent„Wn,) > Ch( y ) - Chfa, + 



Ch(y) - Ch(y + ecr ) 

£ 

for all £ > 0, provided that statement (a) is applicable to (cr^m, of m). 

To conclude, we prove that <7q satisfies the assumptions made in (a). Indeed, (4.14) is 
satisfied by construction (with c = 2||o"o||oo> C — R = n ), thanks to 

o»(x) = c n X n (x)<T (x) < 2||a- ||ooe- 9 ^" (D(x) - n)2 whenever D(x) > n, 
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for n large enough to ensure c n < 2. In addition, the inequality |VlogX n |(x) < 18K D(x) 
gives 

|Vlogo-^| w (x) < |VlogXnU(x) + \V\oga \ w (x) < l8K~D(x) + ^\Va \ w (x), 
so that \Vlogafi\ w G £>(Ch). □ 

4.3 Quadratic Cheeger's energies 

Fix a metric measure space (X, dm) £ X; without assuming any curvature bound, in this 
section we apply the tools obtained in Lemma 4.5 to derive useful locality and structural 
properties on the Cheeger energy in the distinguished case when Ch is a quadratic form on 
L 2 (X, m). Since Ch is 2-homogeneous and convex, this property is easily see to be equivalent 
to the parallelogram identity (see for instance [12, Proposition 11.9]) 

Ch(/ + 5 ) + Ch(/- 5 ) = 2Ch(/) + 2Ch(<7) for every /, g G L 2 (X, m). (4.19) 

If this is the case we will denote by £ the associated Dirichlet form with domain £>(£) := 
W 1,2 (X, d, m), i.e. £ : [D(£)] 2 — > R is the unique bilinear symmetric form satisfying (see e.g. 
[12, Prop. 11.9]) 

£(/,/) = 2Ch(/) V/G ^ 2 (X,d,m). 

Recall that W^ 2 {X,d,m) = D(Ch) n L 2 {X, m). 

We will occasionally use this density criterion in the theory of linear semigroups. 

Lemma 4.9 (Density of invariant sets) Let £ be the bilinear form associated to a non- 
negative and lower semicontinuous quadratic form Q in a Hilbert space H , and let St be the 
associated evolution semigroup. If a subspace V C D(Q) is dense for the norm of H and 
St-invariant, then V is also dense in D(Q) for the Hilbert norm wE{u, u) + (it, u) 2 . 

Proof. If u £ D(E) satisfies E(u,w) + (u,w) = for all w G V, we can choose w = Stv, 
v G V, and use the fact that St is self-adjoint to get 

E(S t u, v) + (S t u, v) = £(tt, S t v) + (u, S t v) = V« G V, t > 0. 

Since Stu belongs to the domain of the infinitesimal generator of Stu, v >— > £.(Stu,v) is con- 
tinuous in -D(£) for the H norm, hence E(StU,v) + (Stu,v) = for all v G D(E) and t > 0. 
Choosing v = Stu and letting 1 1 gives n = 0. □ 

Proposition 4.10 (Properties of W l,2 (X, d, m)) //Ch is quadratic inL 2 (X,m) then W l,2 {X, d, m) 
endowed with the norm yjl/IH + £(/> /) is a separable Hilbert space and Lipschitz functions 
are dense. 

Proof. We already know that M^ 1,2 (X, d, m) is complete [2, Remark 4.7] and therefore it is a 
Hilbert space since Ch is quadratic. In particular if f n , f G W 1,2 (X, d, m) satisfy \\f n — /||2 — > 
and Ch(/ n ) — > Ch(/) then / n — > / strongly in M^ 1,2 (X, d, m). In fact, by the parallelogram 
identity and the L 2 (X, m)-lower semicontinuity of Ch 

hmsupCh(/ - f n ) = limsup (2Ch(/) + 2Ch(/ n ) - Ch(/ + /„)) 

= 4Ch(/) - liminf Ch(/ + /„) < 4Ch(/) - Ch(2/) = 0. 
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The density of Lipschitz function thus follows by (2.22). The separability of W 1,2 (X, d, m) 
follows considering the invariant set V := Ut>o ^tL 2 (X, m), which is a subspace thanks to the 
semigroup property, dense in W 1 ' 2 (X,d,m) thanks to Lemma 4.9. Using (2.34) and the sep- 
arability of L 2 (X, m) it is easy to check that V is separable with respect to the W 1,2 {X, d, m) 
norm, whence the separability of W 1,2 (X, d,m) follows. □ 

The terminology Dirichlet form, borrowed from [16], is justified by the fact that £ is closed 
(because Ch is L 2 (X, m)-lower semicontinuous) and Markovian (by the chain rule (2.20)). 
Good references on the theory of Dirichlet forms are [16, 17] for locally compact spaces and 
[28]. The second reference (but see also [17, A.4]), where the theory is extended to infinite- 
dimensional spaces and even to some classes of non-symmetric forms is more appropriate for 
us, since we are not assuming local compactness of our spaces. 

In this section we analyze the basic properties of this form and relate the energy measure 
[/] appearing in Fukushima's theory, a kind of localization of £, to [V/l^,. Recall that for 
any / £ -D(£) D L°°(X, m) the energy measure [/] (notice the factor 1/2 with respect to the 
definition of [17, (3.2.14)]) is defined by 

[/](¥>) :=-£(/, fcp) <p) for any <^ G D(£) n L°°(X, m). (4.20) 

We shall prove in Theorem 4.18 that [/] = |V/|^m. The first step concerns locality, which is 
not difficult to prove in our setting: 

Proposition 4.11 £ is strongly local: 

f,g€ D(E), g constant on {/ + 0} => £(/, g) = 0. (4.21) 
Proof. By definition we have 

2£(/,<7) = J \V(f + g)\ 2 w -\Vf\ 2 w -\Vg\ 2 w dm. (4.22) 

By the assumption on g, locality (2.18) and chain rule (2.20) we get that m-a.e. on {/ ^ 0}. 
|V(/ + g)\ w = | V/|io and [Vgl^ vanishes. On the other hand, m-a.e. on X \ {/ / 0} we have 
|V(/ + ff)U = \Vg\ w and |V/|u, vanishes. □ 

The identification of [/] with | V/|^m requires a deeper understanding of the Leibnitz formula 
in our context. Our goal is to prove the existence of a bilinear symmetric map from [L>(Ch)] 2 
to L 1 (X, m), that we will denote by V/ • Vg, which gives a pointwise representation of the 
Dirichlet form, in the sense that 

£(/,5) = J V/ • Vgdm V/, g G W^pT^m). 

In the spirit of Cheeger's theory [8], we may think to differentials of Lipschitz functions as L°° 
sections of the cotangent bundle to X. Then, if we keep / fixed, the map g i— > V/(x) • Vg(x) 
may be interpreted as the action of df on the section induced by g, and we may use this map 
to define the gradient of /. 

We remark that V/ • Vg could be considered as the "carre du champ" operator T(f,g) 
of T-calculus in this context, widely used in the study of diffusion semigroups. We adopted 
the appealing notation V/ • Vg since in our approach this quantity is directly obtained by 
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a pointwise "vertical" differential of the squared weak upper gradient (like when one tries 
to recover a scalar product from the squared norm). As a byproduct, we will show that the 
Laplacian satisfies a suitable formulation of the diffusion condition [5, 1.3], so that useful 
estimates can be derived by the so called T-calculus. An example of application will be given 
in Section 6.2. 

Let f,g€ -D(Ch) and notice that the inequality 

|V((1 - A)/ + Xg)\ w < (1 - A)|V/U + X\Vg\ w m-a.e. in X, 

valid for any A G [0, 1], immediately yields that e h-» |V(/ + £g)|^ satisfies the usual convexity 
inequality m-a.e. and e H> e _1 [|V(/ + eg)\^ — |V/|^] is nondecreasing m-a.e. in R \ {0}, in 
the sense that 



|V(/ + e' 5 )i; 



iv/fc < |v(/ + gg)l 



|v/| 



m-a.e. in X for e', e G R \ {0}, e' < e. 



e' e 
Definition 4.12 (The function V/ • Vg) For f,g€ D(Ch) we define V f • Vg as 

\V(J + eg)\l-\Vf\\ 



V/ • Vg := lim ■ 

where the limit is understood in L 1 (X, m). 



U) 



2s 



(4.23) 



Notice that by monotone convergence and the lower bound obtained by taking a negative 
e' in the previous monotonicity formula, the limit in (4.23) exists in L 1 (A, m) along any 
monotonically decreasing sequence (ej) C (0,oo). This obviously implies existence of the full 
limit as e ! 0; we also have 



v/-v/ = |v/|; 



m-a.e. in X. 



(4.24) 



Notice also that we don't know, yet, whether (/, g) i— > V/ • Vg is symmetric, or bilinear, the 
only trivial consequence of the definition being the positive homogeneity w.r.t. g. Now we 
examine the continuity properties of V/ • Vg with respect to g. 

Proposition 4.13 For f,g,g^ D(Ch) it holds 

| V/ • Vg - V/ • Vg\ < \Vf\ w \V(g - g)\ w m-a.e. in X 

and, in particular, V f • Vg G L 1 (A, m) and 

|V/ • Vg\ < \Vf\ w \Vg\ w m-a.e. in X. (4.25) 

Proof. It follows from 

I V(/ + eg)\ 2 w - \Vf\l) - (| V(/ + e~g)\l - \V f\ 2 w ) | = | |V(/ + eg)\ 2 w - \V(f + eg)\ 2 u 

|V(/ + eg)\ w - | V(/ + eg)\w) (|V(/ + eg)\ w + \V(f + e~g)\ w ) 
< e\V(g - ~g)\ w (|V(/ + eg)\ w + |V(/ + eg)\ w ), 

dividing by e, letting e 1 and using the strong convergence of \V {f + eg)\ w and \V(f + eg)\ w 
to |V/|„. □ 
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Observe that the second chain rule given in Lemma 4.7 grants that 

J V(0 o /) • Vg dm = J (4>' o /) V/ • Vg dm (4.26) 

for (j) nondecreasing and C 1 on an interval containing the image of /. 
Proposition 4.14 For any /, g G -D(Ch) it holds 

£(/,<?) = I V/-V 9 dm. (4.27) 



Also, we have 

Vf-Vg = -V(-f)-Vg = -Vf-V(-g) = V(-f)-V(-g) m-a.e.mX. (4.28) 

Proof. The equality (4.27) follows replacing g by eg into (4.22), dividing by e and letting 
e i 0. To get (4.28) notice that the m-a.e. convexity of e ^ |V(/ + e<7)|^(x) in R yields 

V/- V# + V/- V(-s) > m-a.e. in X. (4.29) 

Since £(/,<?) = — £(/, —5) we can use (4.27) to obtain that the sum in (4.29) is null m-a.e. in 
X. We conclude using the identity V/ • V<? = V(— /) • V(— g), a trivial consequence of the 
definition. □ 

Lemma 4.15 Let u G D(Ch) be a given bounded function and let £^(7) be defined as in (4.5). 
Then there exists a test plan 7r satisfying (eo)ij"7r = m and 

ton = Km " ° e ° ~ u ° et = \ Vu \ w e in L 2 (AC 2 ([0, l];X),7r). (4.30) 

Proof. Let po := ce", where c is the normalization constant, put po := pom and /?t := H t (po). 
Notice that po is uniformly bounded away from and 00 and that Ch(pt) — > Ch(po) as t J, 
implies, by the same Hilbertian argument of Proposition 4.10, strong convergence of iVptl^ 
to \Vpo\ w in L 2 (X,m). Define the functions A t , B t , C u A : AC 2 ([0, l];X) ->■ R by (with the 
usual convention if Et("f) = 0) 



^(7) 
A(7) 

Q(7) 



logPo(7o) ~ logPo(7t) = ^(7o) ~ ^(7t) 

logPo(7o) ~ logPo(7t) = ^(7o) ~ u ilt) 
Et(l) Bkin) ' 

Etirr) 
t ' 



i /' |V«&(7,) ds 1 
* ./o Po(7.) V * -A) 



Now use [25] to get the existence of a plan tv G ^(AC 2 ([0, 1]; X)) such that (et)jj7r = pt '■= Pttn 
for all i G [0, 1] and 

/" |7t| 2 d7r( 7 ) = \p t \ 2 for a.e. t G [0, 1]. 
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The maximum principle ensures that 7r is a test plan. 

By Lemma 4.16 below we get that Dt — > \Vu\ w o eo in L 2 (AC 2 ([0, 1];X),tt). From the 
second equality in (2.27) we have 



(4.31) 



„ „2 1 [ f\ ,2 1 f\ ,2 

lim \\Ct o = lim - / / 7s ds chr = lim - / \h s \ ds 
40 40 t J Jo 40 t Jq 

= hm- / / dmds= / dm = VttL o eo o, 

40 t J J p s J Po 

and from Lemma 4.4 and (4.8) we know that 

\Bt\ < Dt and limsup ||-Bt||| < II iVit^ o eo|||- (4.32) 

40 

Estimates (4.31) and (4.32) imply 



limsup / At d7r = lim sup / BtCt d7r < limsup / |2?t|Cfd7r 
40 J 40 7 40 



< limsup 1 1 -B* 1 1 2 1 1 C* 1 1 2 < |||Vu| w o eo|| 2 - 
40 



(4.33) 



Notice that from the convexity of z H > z log z we have 

A , f logpo(po - Pt) , . Ent m (/i ) - Ent m (/i t ) 

/If Cl7r = / CL7T -> , 

J t - t 

and from the first equality in (2.27) we deduce 

lim Ent m (^)-Ent m(Mt ) = ^ 1 ft JVp^ ^ = f \^ dm = |||v< Q ^ 



40 t 40 t Jo J p s J Po 

Thus from (4.33) we deduce that f A t d7r converges to || |Vu|^ o eo||| as t J, 0. Repeating now 
(4.33) with liminf we deduce that also \\Bt\\\ converges as 1 1 to |||Vu| w o eo 1 1 1 ■ Now, this 
convergence, the first inequality in (4.32) and the L -convergence of Dt to o e yield 

the L 2 -convergence of \Bt\ to the same limit. 

Finally, from the fact that the first inequality in (4.33) is an equality, we get that also Bt 
converges to | | w o eo in L 2 (tv). Also, since the second inequality in (4.33) is an equality 
and (4.31) holds one can conclude that Ct converges to |Vn|„, o eo in L 2 (tv) as well. 

Thus 7r has all the required properties, except the fact that (eo){j7r is not m. To conclude, 
just replace tv with cp^ 1 o eo7r, c being the renormalization constant. □ 

Lemma 4.16 Let f £ L 1 (X,m) be nonnegative and define F t : AC 2 ([0, 1]; AT) — > [0,oo], 
t€ [0,1], by 

Ft(l) :=J\ ff^)ds t 6(0,1], Fo-vTCtoT- 

Then Ft — > Fq in L 2 (AC 2 ([0,1];X),tt) ast 10 for any test plan 7r whose 2-action j j |7t| 2 dtd7i 
is finite. 
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Proof. To prove the thesis it is sufficient to show that F 2 — > Fq in L 1 (AC 2 ([0, 1]; X), 7r). 

Now assume first that / is Lipschitz. In this case the conclusion easily follows from 
the inequality \Ff{^) — Fq < Lip(/)i ^^(70,7.5) ds < Lip(/) J * |7 S | ds and the fact that 
J L \jt\ 2 dtdiv(j) < 00. To pass to the general case, notice that Lipschitz functions are dense 
in L l {X, m) and conclude by the continuity estimates 

J- t J\(j s )ds div<- t J^ J |M7.)|dird*<C7||/i||i, J \h{Ta)\6*<C\\h\\x, 



where C > satisfies (et)^ < Cm for any t G [0, 1]. 



□ 



Proposition 4.17 (Leibnitz formula for nonnegative functions) Let /, g, h G D(Ch)n 

L°°(X, m) with g, h nonnegative. Then 

£(f,9h) = J Vf -V(gh) dm = J hVf ■ Vg + gVf ■ Vh dm. (4.34) 

Proof. Notice first that if (g n ), (h n ) are equibounded and converge in W 1,2 {X, d, m) to g, h 
respectively then (2.21) ensures that g n h n converge to gh strongly in W 1,2 (X, d, m). Hence, 
taking Proposition 4.10 and Proposition 4.13 into account, we can assume with no loss of 
generality g, h to be bounded, nonnegative and Lipschitz. 

Now we apply Lemma 4.15 with u = f. The definition of V/ • V(g/i) and inequality (4.10) 



gives 



V/ ■ V( S Ji) dm 



_ liminf /lv(/ + ^)&-lv/£ dm 

£ j.o J e 

^ ,. f g{io)Hio) - g{it)h(j t ) , , \ 

> hmsup / d7r(7). 

40 J t 



Now observe that the convergence 

(s(7t) - 5(7o)) {K-Jt) ~ Hlo)) 



< Lip(g) Lip(h) 



d 2 (7o,7t) 



inL^vr), 



ensures that 



limsup ; g{i*)Ki Q )-g{it)K lt ) A<i) > Hm . nf r g(7o)M7o) - g{it)Ki t ) ^ 



1 10 



t 



40 



t 



>liminf / g (7o) /l(7 ° ) " M7f) d 7 r(7) + liminf / h^) 9 ™ ~ d 7r( 7 ). 



Now applying inequality (4.10) to the plans (g o eo) 7r and (h o eo) 7r we get 

Uminf /- g(7o) ^.)-M7,) d „ (7) > Umsup r iv/s-iv(/-<r*)S drot 

40 J t eiO J £ 

hminf f h( 70 ) 9M ~ 9M dn( 7 ) > limsup [ J^lg, - |V(/ - dffl 
40 7 * £4.0 7 e 
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Recalling the convergence in L (X, m) of the difference quotients in (4.23) we get 

limsup j g \ V f\™-\ V ^- £h ^ & m = -J g vf. V(-/i)dm = JgVf- Vhdm, 
limsup J fe lZZmy ~ e 9)\l dm = _ y w/ . V (-(7)dm = y W/ ■ V#dm. 

Thus we proved that the inequality > always holds in (4.34). Replacing / with — / and using 
(4.28) once more we get the opposite one and the conclusion. □ 

Theorem 4.18 (Leibnitz formula and identification of [/]) Let (X, d,m) S X and let 

us assume that Cheeger's energy Ch is quadratic in L 2 (X,m) as in (4.19). Then 

(i) The map (f,g) i— > Vf-Vg from [.D(Ch)] 2 to L 1 (X,m) is bilinear, symmetric and satisfies 
(4.25). In particular it is continuous from [W 1,2 (X, d, m)] 2 to L l (X,m). 

(ii) For all f,g£ D(Ch) it holds 

|V(/ + g)\l + |V(/ - = 2|V/| 2 + 2\Vg\ 2 w m-a.e. m X. (4.35) 
In particular Ch is a quadratic form in L 1 (X, m). 
(Hi) The Leibnitz formula (4.34) holds with equality and with no sign restriction on g, h. 
(iv) The energy measure [/] in (4.20) coincides with |V/|^m. 

Proof. The continuity bound follows at once from (4.25). In order to show symmetry and 
bilinearity it is sufficient to prove (4.35) when f,g€ -D(Ch). In turn, this property follows if 
we are able to prove that 

f ^ j h\Vf\ 2 w dm is quadratic in D(Ch) (4.36) 

for all h G L°°(X, m) nonnegative. Since D(Ch) n L°°(X, m) is weakly* dense in L°°(X,m), 
it is sufficient to prove this property for nonnegative h S D(Cb) n L°°(X,m). Pick / = g £ 
.D(Ch) n L°°(X,m) nonnegative in (4.34) to get 

J h\Vf\ 2 w dm = - j fVf ■ Vhdm + J V/ • V(//i) dm 

= ~y V(y) • V/idm + y V/- V(//i)dm 

= -S(y,/i) + £(f,fh), 

having used equation (4.26) in the second equality. Now, splitting / in positive and negative 
parts, we can extend the formula to -D(Ch)nL°°(X, m), since £ is bilinear and the strong local- 
ity ensures E(f + ,f~h) = 0, £(/", f+h) = 0. Both maps / £(/ 2 /2, h) and / ^ £(/, fh) are 
immediately seen to be quadratic, so the same is true for j \\7 fl^hdm. Thus we proved that 
the map J h\V fl^ dm is quadratic on D(Ch) n L°°(X, m). The statement for the full domain 
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D(Ch) follows from a simple truncation argument: if f N := max{min{/, JV}, — iV}, g N 
max{min{g, N}, — N} £ L 2 (X,m) are the truncated functions, the chain rule (2.20) gives 

h\V(f N + g N )\i dm + J h\V(f N -g N )\ 2 w dxn<2 J fc|V/|*dm + 2 j h\Vg\ 2 w . 



which yields in the limit one inequality. A similar argument applied to f+g and f—g provides 
the converse inequality and proves (4.36). Finally, we can use the fact that h is arbitrary to 
prove the pointwise formulation (4.35). □ 

The property (4.36) shows that if (X, d,xn) gives raise to a quadratic Cheeger's energy, 
also (X,d,hxn) enjoys the same property, provided a uniform bound < c < h < c _1 is 
satisfied (indeed, we can use [2, Lemma 4.11] to prove that |V/|iu is independent of h). The 
next result consider the case when h is the characteristic function of a closed subset of X. 

Theorem 4.19 Let (X,d,xn) € X and let Y C X be a closed set of positive measure. For 
f : Y — )■ R, denote by \Vf\ W Y the minimal weak upper gradient of f calculated in the metric 
measure space (Y, d, m(y) _1 m| y ). Then: 

(i) Let f : X — >■ R Borel and Sobolev along a.e. curve of X, and define g : Y — >■ R by 
g := fly. Then g is Sobolev along a.e. curve ofY and \Vf\ w = \Vg\ W: y xn-a.e. in Y. 

(ii) Let g : Y — > R be Borel, Sobolev along a.e. curve in Y and such that dist(supp g, dY) > 
0. Define f : X — >■ R by /| y := g and f\ x ^ Y := 0- Then f is Sobolev along a.e. curve 
of X and \V f\ w = Xy\^g\w,Y xn-a.e. in X. 

(Hi) Lf moreover Ch is a quadratic form in L 2 (X,m) according to (4.19) and m(dY) = 0, 
then 



CM/) := 



/ l^flwY^xny if f is Sobolev along a.e. curve inY, 
* Jy 



+oo otherwise, 

is a quadratic form in L 2 (Y, my). 

Proof. (i) The fact that g is Sobolev along a.e. curve in Y is obvious, since this class of 
curves is smaller. It is also obvious that |Vg| W) y < (V/l^, m-a.e. in Y, so that to conclude it 
is sufficient to prove the opposite inequality. Let G : X — > [0, oo] be defined by 

G(x) := ( |V ^' y(x) if + t Gy : (4.37) 

[ +oo otherwise. 

Then it is trivial from the definition that G is a weak upper gradient for / in X. Thus, the 
fact that |V/| w is the minimal weak upper gradient gives that jV/ju, < G m-a.e. in X, which 
is the thesis. 

(ii) From the hypothesis that g is Sobolev along a.e. curve in Y and supported in a set having 
positive distance from dY it follows that / is Sobolev along a.e. curve in X. To prove this, 
if C denotes the support of g, notice first that for any absolutely continuous curve 7 the set 
L r := {t G [0, 1] : dist(7t, C) = r} is finite for a.e. r (if jt is Lipschitz we can apply the coarea 
inequality, see for instance [14, Corollary 2.10.11], in the general case we can reparameterize 
7). Now, setting R := dist(C, dY) > and choosing r G (0, R) such that L r is finite, we can 
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use this set of times to split 7 in finitely many curves contained in Y and finitely many ones 
not intersecting C. The equality (V/l^, = Xy\Vg\ W; Y tn-a.e. in X then follows by locality (in 
X\Y) and by (i) (in Y, since g = /| y ). 
(iii) Fix r > 0, define 

Y r :={xeY: d{x,dY)>r}, 

so that Y r f Y\dY as r 4, 0, and let X r ■ Y — > [0, 1] be a Lipschitz cut-off function with support 
contained in Y\Y r / 2 and identically equal to 1 on Y r . Notice that, since m(dY) = 0, to prove 
the quadratic property of Chy it is sufficient to prove, for all r > 0, that / H- J Y | V f\ 2 wY dmy 
is quadratic in the class of functions which are Sobolev along a.e. curve in Y . By the previous 
points and the locality principle (2.18) we know that 

|V/| w ,y = |V(/X P )|w = |V/U m-a.e. in Y r , 
so that the conclusion follows from (4.35) of Theorem 4.18. □ 



5 Riemannian Ricci bounds: definition 

Let (X, d,tti) G X. We say that (X, d,m) has Riemannian Ricci curvature bounded below by 
K (in short, a RCD(K, 00) space) if any of the 3 equivalent conditions of Theorem 5.1 below 
is fulfilled. Basically, one adds to the strong CD(K, 00) condition a linearity assumption on 
the heat flow, stated either at the level of M[ or at the level of H^. A remarkable fact is that 
all these conditions are encoded in the EVI^- property of the gradient flow. 

Before stating the theorem we observe that linearity at the level of will be understood 
as additivity, namely 

Jf t ((l- \)fi + \v) = (1 - \)J%(ji) + \J%(u) V/i, v€ &2(X,ra), Ae[0,l]. 

Theorem 5.1 (3 general equivalences) Let (X,d,m) G X. Then the following three prop- 
erties are equivalent. 

(i) (X, d,m) is a strong CD(K, 00) space and the semigroup J%?i on ,^2(X,m) is additive. 

(ii) (X, d, ttl) is a strong CD(K, 00) space and Ch is a quadratic form on L 2 (X,m) according 
to (4.19). 

(iii) (X, d,m) is a length space and any [i G &2(X, m) is the starting point of an EVI^ 
gradient flow o/Ent m . 

If any of these condition holds, the semigroups and J#t are also related for all t > by 

(H t /)m = J f(x)m(S x )dm(x) V/GL 2 (X,m), (5.1) 

meaning that the signed measure (H t /)m is the weighted superposition, with weight f(x), of 
the probability measures J%t(5 x ). 

Proof. (i) => (ii). The additivity assumption on the heat semigroup, the identification 
Theorem 2.17 and the 1-homogeneity of the heat semigroup in L 2 (X,m) (Proposition 2.18), 
easily yield that the heat semigroup Hi is linear in L 2 (X,m) as well (see the proof of (5.1) 
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below). This further implies that its infinitesimal generator —A is a linear operator, so that 
D(A) is a linear subspace of L 2 {X, m) and A : D{A) — > L 2 is linear. Now, given / G -D(Ch), 
recall that t i— > Ch(H t (/)) is continuous on [0, oo) and locally Lipschitz on (0, oo), goes to 
as t -»• oo and ^Ch(H t (/)) = -|| AH t (/)||| for a.e. t > (Theorem 2.12), thus 



/>oo 

Ch(/)= / ||AH t (/)||idt V/GD(Ch). 

JO 



Now, recall that quadratic forms can be characterized in terms of the parallelogram identity; 
thus Ch, being on its domain an integral of the quadratic forms / i-> ||AHt(/)|||, is a quadratic 
form. 

(ii) => {Hi). Using {iii) of Proposition 2.21, to conclude it is sufficient to show that J%t(fi) is 
an EVIa gradient flow for Ent m for any fi <C m with density uniformly bounded away from 
and infinity. Thus, choose fi = pm G &i{X) such that 0<c<p<C<oo and define 
fit '■= J0t(p) = Pttn. By Proposition 2.20, in order to check that {fit) is an EVIa' gradient 
flow it is sufficient to pick reference measures v in (2.30) of the form v = am G &2(X), with 
rf bounded and with bounded support. For any t > 0, choose 7r t G GeoOpt(//j, v) given by 
the strong CD{K, oo) condition and let <pt be a Kantorovich potential for (/it, v). Now, recall 
that Theorem 4.8(b) provides the lower bound 

Ent m („) - Ent m(Mt ) - ") > ^ ^ = ^ + ^ , 

thus to conclude it is sufficient to show that for a.e. t > it holds 

A V(^, ,) < lim + (5 . 2) 

di2 ; ~ £4-0 e v ' 

By Theorem 4.1 we know that for a.e. t > it holds 

dt 2 W * ^ "> " Sg i • (5 - 3) 

In order to obtain (5.2) from (5.3) we crucially use the hypothesis on Cheeger's energy: 
recalling (4.13), the fact that Ch is a quadratic form ensures the identity 

Ch(pt - eipt) - Ch{p t ) = ChQgt) - Ch(y? f + ep t ) + Q ^ 
(see (4.13)) and therefore (5.2) is proved. 

(m) => (i). By Lemma 5.2 below we deduce that {X, d,m) is a strong CD{K, oo) space. We 
turn to the additivity. Let (/4) be two EVIa gradient flows of the relative entropy and 
define fit '■= A/i° + (1 — X)fif, A G (0, 1). To conclude it is sufficient to show that {fit) is an 
EVIa gradient flow of the relative entropy as well. By (2.7) we know that {fit) C &i{X) is 
an absolutely continuous curve, so we need only to show that 

limgup e^Wi{p t+h ,u)-W 2 {fi,u) + ^ EnUi/)) ^ > Q (5 4) 

Thus, given a reference measure u, for any t > let -y t ^ Opt(^,z/), define := (74)^?, 
:= (7t)(j/4 (recall Definition 2.1 and notice that fi®,p\ <C = 7r»7t)- By equation (2.9) 



1 . 
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we have 



W2(ih,v) = j d 2 (x,y)d Jt (x,y) = j d\x, y) ((1 - A)^(x) + A^(s)) d 7t (x, y) 



= (1-A) | d 2 (x,y)^-(x)d 7t (x,y) + A | d 2 (x, v)^(x) d 7t (x, y) 

= (l-A)Wf(M t °,^) + AW|(/4,i/ t 1 ), 

while the convexity of yields 

WlW, i/) < (1 - A)W- 2 2 (/i? +fe! + AW|(^fc, ^ t ), Vfc > 0. 

Hence for any t > we have 

r ^W^W,!/)-^,!/) ^ .... e Kh Wi(tf +h ,v?)-Wj(rf,v?) 

hm sup < (1 — A) lim sup - 

hio 2n 2/i 

+Anmsup — . (5.5) 

hio 2/i 

Now we use the assumption that (/j,®) and are gradient flows in the EVIx sense: fix t 
and choose respectively and v\ as reference measures in (2.32) to get 





«?) 


- W|(M t ,^ ) 




2/i 










2h 



lim sup — < Ent m (u t ) - Ent m (fi t) , 

hio 



(5.6) 



lim sup t+ ' < Entn,^ 1 ) - Ent m (/4). 

HO 



Finally, from Proposition 2.3 we have 

Ent m Gut) - Ent m (zv) < (1 - A) (Ent ro (//°) - Ent m (^ )) + A^Ent TO (/xJ) - Ent m (^)). (5.7) 

Inequalities (5.5), (5.6) and (5.7) yield (5.4). 

Finally, we prove (5.1). By linearity we can assume that / is a probability density. Notice 
that the additivity of the semigroup Jf? t gives J%t(^2i &i&xi) = a i^t(^Xi) whenever a, > 0, 
^jfl,; = 1 and Xi £ suppm. Hence, if / is a continuous probability density in suppm with 
bounded support, by a Riemann sum approximation we can use the continuity of to get 



■m(fm) = J f(x)Jn{S x ) dm(x) 



and the identification of gradient flows provides (5.1). By a monotone class argument we 
extend the validity of the formula from continuous to Borel functions / G L 2 (X, m) . □ 

Lemma 5.2 Any (X, d, m) 6l satisfying condition (Hi) of Theorem 5.1 is a strong CD(K, oo) 
space. 
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Proof. Let us first prove that (X, d,m) is a CD{K, oo) space. We notice that, since by 
assumption (suppm, d) is a length space, then &2(X,m) is a length metric space. Therefore, 
up to a suitable reparameterization, for every fi , /xi £ D(Ent m ) C ^(X, m) and e > 
there exists a L e -Lipschitz curve (p 6 ) £ Lip([0, 1]; ^(X, m)) connecting po to fi± with L 2 < 
W 2 2 (// ,Aii) +e 2 - 

We can thus set fi e s := J^ £ (fi e ), where J4?t(p) denotes the /^-gradient flow starting from fi, 
so that by (2.36) of Proposition 2.22 we get 

K e 2 
Ent m (/2*) < (1 - s)Ent m (/i ) + sEnt m (/xi) - — s(l - s)W|(Mo, Mi) + 

Since e 2 /1k{s) — > as e 4- 0, the family {/i^} has uniformly bounded entropy and therefore it 
is tight. By (ii) of Proposition 2.21 we know that 

W2(Ar> AD ^ e _is:£ L e |r — s| for every r,s £ [0,1], e > 0. 

Since limsup e ^ ^e < ^(MOj^i)) we can applying the refined Ascoli-Arzela compactness 
theorem of [3, Prop. 3.3.1] to find a vanishing sequence e n i and a limit curve (/i s ) C 
&2(X, m) connecting /^o to /ii such that 

//* n ->• fi s in ^(X), W 2 (Mr,/i s ) < W 2 (fJ-o,fJ.i)\r-s\, Ent m (^ s ) < oo for every r, s £ [0,1]. 

It turns out that (p s ) is a geodesic in Z)(Ent m ) connecting to /xi and therefore Proposi- 
tion 2.22 shows that Ent m is i^-convex along fi s . 

The same Proposition shows that Ent m is i^-convex along any geodesic contained in 
3 g 2(X, m): in particular, taking any optimal geodesic plan 7r as the one induced by the 
geodesic obtained by the previous argument, Ent m satisfies the if-convexity inequality asso- 
ciated to 7Vp as in Definition 3.1, since all the measures fjLp,t belong to D(Ent m ). □ 



6 Riemannian Ricci bounds: properties 
6.1 Heat Flow 

In this section we study more in detail the properties of the L 2 -semigroup in a RCD(K, oo) 
space (X, d, m) £ X and the additional informations that one can obtain from the relation (5.1) 
with the WVsemigroup Jtfj.. First of all, let us remark that since Ch is quadratic the operator 
A is in fact the infinitesimal generator of H^, and therefore is linear. Furthermore, denoting by 
£(u, v) : [D(Ch)] 2 — > E the Dirichlet form induced by Ch, the relation £(u, v) = — f -uAudm 
for u £ D(A), v £ -D(Ch) implies that A is self-adjoint in L 2 (X, m) and the same is true for 
H t . 

Also, again by Proposition 2.21, and the definition of RCD(K, oo) spaces, we know that 
for any x £ suppm there exists a unique EVIr- gradient flow Jt? t (5x) of Ent m starting from 
5 X , related to H t by (5.1). 

Since Ent m (^(<5 x )) < oo for any t > 0, it holds Mt(bx) <C m, so that J£j(<5 x ) has a density, 
that we shall denote by pt[x]. The functions pt[x](y) are the so-called transition probabilities 
of the semigroup. By standard measurable selection arguments we can choose versions of 
these densities in such a way that the map (x, y) i— >■ p t [x](y) is m x m-measurable for all t > 0. 
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In the next theorem we prove additional properties of the flows. The information on 
both benefits of the identification theorem: for instance the symmetry property of transition 
probabilities is not at all obvious when looking at 3% only from the optimal transport point of 
view, and heavily relies on (5.1), whose proof in turn relies on the identification Theorem 2.17 
proved in [2]. On the other hand, the regularizing properties of Ht are deduced by duality by 
those of J#t, using in particular the contractivity estimate (see (2.33)) 

W2(J%0i),J%(y))<e- Kt W2(ji,v) t >0, fi, ue &> 2 (X,m). (6.1) 
and the regularization estimates for the Entropy and its slope (apply (2.34) with z := m) 

Ijc(t)Ent ro (^(A*)) + (I ^ ))2 |V"Ent m | 2 (^(/i)) < \w^m). (6.2) 

Notice also that (6.1) yields Wi(J#t(6 x ),J#t(8 y )) < e~ Kt d(x,y) for all x, y G suppm and 
t > 0. This implies that RCD(K, oo) spaces have Ricci curvature bounded from below by K 
according to [31], [21]. Notice also that using (5.1), the identification of gradient flows and a 
simple convexity argument, we can recover the inequality 

Wi{M{n\J^ t {y)) < e- Kt Wi(n,v) 

first with when p,, v G ^2(^;Tn) have densities in L 2 (X, m) and then, by approximation, in 
the general case when p, v G &i(X,m). 

Notice also that, as a consequence of [33, Theorem 1.3] and the density of Lipschitz 
functions one has the weak local (1, l)-Poincare inequality 

\u — -u| dm < 4r J \X7u\ w dm with u := — j—. [ udm 

B r (x) JB 2 r(x) m(B r (x)) J Br (x) 

for all x G suppm, r > 0, u G W l,2 {X, d, m), which implies the standard weak local (1,1)- 
Poincare inequality under doubling assumptions on m. 

Further relevant properties will be obtained in the next section. 

Theorem 6.1 (Regularizing properties of the heat flow) Let (X, d,m) G X be aRCD(K, 
space. Then: 

(i) The transition probability densities are symmetric 

Pt[ x ](y) = Pt[y]{x) m x m-a.e. in X x X, for all t > 0, (6.3) 
and satisfy for all x G X the Chapman- Kolmogorov formula: 

p t+s [x](y) = / pt[x](z)p s [z](y)dm(z) for m-a.e. y G X, for all t, s > 0. (6.4) 



(ii) The formula 

Ht/(x) := j f{y)d,m{8 x ){y) x G suppm (6.5) 

provides a version of Ht/ for every f G L 2 (X, m), an extension of Ht to a continu- 
ous contraction semigroup in L 1 (X, m) which is pointwise everywhere defined if f G 
L°°{X, m). 
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(Hi) The semigroup H( maps contr actively L°°(X,m) in Cft(suppm) and, in addition, Htf(x) 
belongs to C&((0, oo) x suppm). 

(iv) If f : suppm — > M. is Lipschitz, then Htf is Lipschitz on suppm as well and Lip(Hj/) < 
e- Kt Up(f). 

Proof, (i). Fix f,g€ Cb(X) and notice that (5.1) gives 

<?H 4 /dm = J f{x) J gdJ%(5 x )dm(x) = J J f(x)g(y)p t [x](y) dm(y) dm(x). 

Reversing the roles of / and g and using the fact that is self-adjoint it follows that 
J J(pt[x](y) — pt[y](x))f(x)g(y) dm dm vanishes, and since / and g are arbitrary we ob- 
tain (6.3). Formula (6.4) is a direct consequence of the semigroup property J%t +S (5 x ) = 
J^i(Jf s (S x )). 

(ii) Using the symmetry of transition probabilities, for / G L l (X, m) nonnegative we get 
l|Ht/||i = J J f{y)Pt[y]( x ) dm(x) dm(y) = ||/||i. By linearity this shows that H t is well defined 
m-a.e. and defines a contraction semigroup in L 1 (X, m). The fact that the right hand side in 
(6.5) provides a version of H t f follows once more from (5.1) and the symmetry of transition 
probabilities. 

(iii) Contractivity of H t in L°°(X, m) is straightforward. By (6.1) we get that JfP s {5y) — > J&t(5 x ) 
in duality with Cf,(X) when y — > x in X and s — > t. Also, the a priori estimate (2.34) shows 
that (t, y) i — y Ent m (J^j(5j / )) is bounded on sets of the form (e, oo) x B, with B bounded and 
e > 0. Thus the family {p t [y\} y& B,t>e is equi-integrable. This shows that p s [y] — > Pt[%] weakly 
in L l (X, m) when (y,s) — > (x,t) G X X (0,oo) and proves the continuity of H t f{x). 

(iv) By (6.5) we get \H t f(x)-H t f(y)\ < Lip(/)Wi(J^(4), J%{6 y )) < U V (f)W 2 {3%{8 x ) , J%(8 y )) . 
We can now use (6.1) to conclude (see [23] for a generalization of this duality argument). 

□ 

Using Lemma 2.8 below we can refine (iv) of Theorem 6.1 and prove by a kind of duality 
argument [23] a Bakry-Emery estimate in RCD(K, oo) spaces. 

Theorem 6.2 (Bakry-Emery in RCD(K, oo) spaces) For any f G D(Ch) and t > we 

have 

|V(H t /)|2 < e- 2Kt H t (\Vf\l ) m-a.e. m X. (6.6) 

In addition, if\Vf\ w G L°°(X,m) and t > 0, then e~ Kt (H t \V f\l) 1 is an upper gradient of 
H t f on suppm, so that 

|VH«/| < e- Kt (H t \Vf\l) 1/2 pointwise in suppm, (6.7) 

and f has a Lipschitz version f : X — )■ R, with Lip(/) < IHV/ltolloo- 

Proof. With no loss of generality we can assume, by a truncation argument, that / G 
L°°(X,m). 

Given / : X — > R Lipschitz and x, y G suppm, let 7 S G AC 2 ([0, 1]; suppm) be connecting 
x to y. Given t > 0, we can then apply Lemma 2.8 with p s := J^(5 7s ) to get 



|H t /(x)-H t /(y)| 



J fdJfi(5 x )- J fd<m(b y ) < J\J |V/| 2 d^) 1/2 |As|d S (6.8) 

< e-K* f\l \Vf\ 2 dp s ) 1/2 \%\ds = e - Kt (\H t (\Vf?)( ls )) l % s \ds 
Jo J Jo 
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(in the last inequality we used the contractivity property, which provides the upper bound on 
\fi s \)- Notice that we can use the length property of suppm to get, by a limiting argument, 

\H t f(x)-H t f(y)\ <d(x,y)sup{e-^(H t (|V/| 2 )) 1/2 (z) : d(z, x) < 2d(x, y)} (6.9) 

for all x, y G suppm. Taking the continuity of H^|V/| 2 into account, this implies the Lipschitz 
Bakry-Emery estimate 

|VHJ| 2 < e- 2m H t |V/| 2 in suppm. 

To prove (6.6) for functions / G D(Ch) we approximate / in the strong W ' topology by 
Lipschitz functions f n in such a way that |V/„| ->■ |V/| M in L 2 (X,m) and use the stability 
properties of weak upper gradients. 

Now, assume that L := |||V/| w ||oo is finite. From (6.6) we obtain that for all t > 
the continuous function f t := f\ t f satisfy HlV/t^Koo < Le~ Kt . Given x, y G suppm, fix 
r > and apply (3.2) to find a geodesic test plan 7r connecting m(B r (x))~ 1 m\^B r (x) to 
m(B r (i/)) _1 raL5 r (!/); the weak upper gradient property then gives 



ft dm — + ft dm 

B r (x) JB r {y) 



< 



|V/ t | U) (7 s )|7 s | ds d.7r(7). 



Now, since (et)i^ ^ m we can use (6.6) to estimate the right hand side as follows: 

/ / t dttt- / ft dm < (2r+d(x,y))e-^sup{(H t |V/| 2 ) 1/2 (z) : d(z, x) < 2r + d(x, y)\ . 

JB r (x) JB r {y) L J 

We can now let r — > to get 

\ft(x) - ft(y)\ <d(x,y)e-^sup{(H i |V/U) 1/2 (z) : d(z, x) < 2d(x, y)} . 

This provides the Lipschitz estimate on f t , the upper gradient property and (6.7). Finally, 
choosing a sequence (U) I such that f tl — > f m-a.e. we obtain a set Y C suppm of full 
m-measure such that f\y is L-Lipschitz; / is any L-Lipschitz extension of f\y to X. □ 



Remark 6.3 (Weak Bochner inequality) Following verbatim the proof in [19, Theorem 4.6], 
relative to the Alexandrov case, one can use the Leibnitz rule of Theorem 4.18 and the Bakry- 
Emery estimate to prove Bochner's inequality (in the case ./V = oo) 

iA(V/ • V/) - V(A/) • V/ > KVf ■ V/. 

in a weak form. Precisely, for all / G D(A) with Af £ W 1,2 {X, d,m) and all g £ D(A) 
bounded and nonnegative, with Ag G L°°(X,m), it holds: 

\ [ A 5 |V/| 2 dm- ! gV(Af)-Vfdm>K f 5 |V/| 2 dm. 
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Remark 6.4 (Lipschitz continuity of Htf and J^t(fi)) If we assume the stronger L 1 t— > 
LP regularization property 

||H t /||p<C(*)||/||i for every / G L 2 (X,m), t>0 (6.10) 

for some p > 1, then we can improve (6.6) to a pointwise inequality as follows: 

|V(H t /)| 2 < e~ 2Kt H t {\V f\ 2 w ) in suppm, for all / G L 2 (X,m). 

Indeed, we can first use (6.10) to get, by approximation, ||pt[x]||p < C(t) for all x G suppm. 
Using the Young inequality for linear semigroups, this gives the implication 

||H t /||,. < C(t)\\f\\ q whenever \ + 1 < - + -. 

q* q p 

Then, choosing N > 1 so large that p > N/(N — 1), by iterating this estimate ./V times the 
semigroup property yields the L 1 i-> L°° regularization 

sup |HJ| < (C(t/JV)H/lli V/ G L\X,m). (6.11) 

supp m 

Now we can apply the first part of Theorem 6.2 to u s := Ht- S f, whose minimal weak upper 
gradient is in L°°(X,m), to obtain that e" Xs (H s (| Vu s | 2 )) 1/2 is an upper gradient of Htf and 
then pass to the limit as s J, to obtain that (Ht(| V/l^)) 1 / 2 is an upper gradient of Htf on 
suppm. Using the length property as in the proof of Theorem 6.2, from this estimate the 
bound on the slope of Htf follows. 

In particular we obtain that Htf is Lipschitz on suppm for all t > 0. Using again the 
inequality ||p*[a;]||oo < C(t) for all x G suppm and the semigroup property we obtain that also 
J%t(fj.) has a Lipschitz density for all fi G &2(X, m). 

The stronger regularizing property (6.10) is known to be true, for instance, if doubling 
and Poincare hold in (X, d,m), see [37, Corollary 4.2]. Notice also that Theorem 6.8 below 
ensures in any case the Lipschitz regularization, starting from bounded functions. ■ 

6.2 Dirichlet form and Brownian motion 

In this section we fix a RCD(K, oo) space (X, d,m). Recalling that the associated Cheeger's 
energy is a quadratic form, we will denote by £ the associated Dirichlet form as in Section 4.3. 
In particular Ch satisfies all the properties stated in Theorem 4.18. 

Notice also that (see for instance [16, Theorem 5.2.3]) it is not difficult to compute [/] in 
terms of Hj or in terms of 3%t{5 x ) by 

[/] = l }$Yt {f2+Htf2 ~ 2fHtf) ' [fm = l i™Yt I J(f( x )-f(y))My)dM(s x )(y)dm( y ). 

(6.12) 

A direct application of the theory of Dirichlet forms yields the existence of a Brownian 
motion in (X, d,m) with continuous sample paths. Continuity of sample paths depends on a 
locality property, which in our context holds in a particularly strong form, see (4.21). 

Theorem 6.5 (Brownian motion) Let (X,d,m) be a RCD(K, oo) space. There exists a 
unique (in law) Markov process {X.t}{t>o} ^ n (suppm, d) with continuous sample paths in 
[0, oo) and transition probabilities J%t(5 x ), i.e. 

P(X s+ t G A\X S = x) = J%(5 X )(A) Vs, t > 0, A Borel (6.13) 

form-a.e. x G suppm. 
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Proof. Uniqueness in law is obvious, since all finite-dimensional distributions are uniquely 
determined by (6.13), (6.4) and the Markov property. 

First, in the case when (X, d) is not locally compact, we prove a tightness property 
arguing exactly as in [4, Theorem 1.2], [28, Proposition IV. 4. 2] (the construction therein 
uses only distance functions and the inequality [d(-,x)] < m) to prove a tightness property, 
namely the existence of a nondecreasing sequence of compact sets F n C suppm satisfying 
cap £ (suppm \F n ) — > (here cap £ is the capacity associated to £). 

Since £ is a strongly local Dirichlet form, and Lipschitz functions are dense in D(£.) for 
the W 1,2 norm (Proposition 4.10) we may apply [16, Theorem 4.5.3] in the locally compact 
case or [28, Theorem IV. 3. 5, Theorem VI. 5] in the general case to obtain a Markov family 
{P x } xes u P pm of probability measures in C([0, oo); X) satisfying 



As a further step we consider the distance induced by the bilinear form £ 

d E (x,y) := sup{\g(x) - g(y)\ : g G £>(£), [g] < m} V(x, y) G suppm x suppm, (6.14) 

which we identify in Theorem 6.7 with d (the function g is the continuous representative in 
the Lebesgue class of g, see Theorem 6.2). 

Remark 6.6 In [22] the techniques of [19, 2] are applied to a case slightly different than 
the one considered here. The starting point of [22] is a Dirichlet form £ on a measure 
space (X, m) and X is endowed with the distance dg. Assuming compactness of (X, dg), 
.fT-geodesic convexity of Ent m in &2{X) with cost function c = d|, doubling, weak (1,2)- 
Poincare inequality and the validity of the so-called Newtonian property, the authors prove 
that the L 2 (X, m) heat flow induced by £ coincides with J%t. The authors also analyze some 
consequences of this identification, as Bakry-Emery estimates and the short time asymptotic 
of the heat kernel (a theme discussed neither here nor in [19]). As a consequence of [22, 
Theorem 5.1] and [2, Theorem 9.3] the Dirichlet form coincides with the Cheeger energy of 
(X, d£,m) (because their flows coincide). This is a non trivial property, because as shown in 
[38], a Dirichlet form is not uniquely determined by its intrinsic distance, see also the next 
result. ■ 

Theorem 6.7 (Identification of dg and d) The function dg in (6.14) coincides with d on 
suppm x suppm. 

Proof. Choosing g(z) = d(z,x), since [g] = |V<?|^m < m we obtain immediately that 
dt{x,y) > d(x,y) on suppm x suppm. In order to prove the converse inequality we notice 
that [g] < m implies, by Theorem 6.2, that the continuous representative g has Lipschitz 
constant less than 1 in X, hence \g(x) — g(y)\ < d(x,y). □ 

We conclude this section with an example of application, following the ideas of [5], of 
the calculus tools developed in Section 4.3 combined with lower Ricci curvature bounds, in 
particular with the Bakry-Emery estimate (6.2). 





with m(iV) = 0. Then we can take the law P := J F x dm(x) in C([0, oo); V) and consider the 
canonical process X t (7) = ^(t) to obtain the result. □ 
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Theorem 6.8 (Lipschitz regularization) If f G L 2 (X, m) then V\tf £ -D(£) /or even/ 
t > and 

2I 2K (i)|VH t /|* < HJ 2 m-a.e. in X; (6.15) 
in particular, if f E L°°(X, m) £/ien H t / £ Lip(suppm) /or even/ f > with 



y/2h K (t) Lip(H t /) < H/lloo /or ever?/ t > 0. (6.16) 

Proof. Let us consider two bounded Lipschitz functions /, p with nonnegative, and let us 
set 

G(s):= J (rW) 2 rWdm, G(0) = J (H t f)\dm, G(t) = ^ / 2 H^dm. (6.17) 

It is easy to check that G is of class C 1 and, evaluating the derivative of G, we obtain thanks 
to (4.20) 

G'(s) = -£((H t _ s f) 2 ,H s p) -2 j H 4 _ s /AHi_ s /H s ^dm 

= -£((H t _ s /) 2 ,H sV3 ) +2£(rW, H t _ s /H^) dm = 2 J |V(rW)| 2 H^dm. 

Using the fact that Ht is selfadjoint and applying the Bakry-Emery estimate (6.6) we get 

G'(s) = 2 J H s (|V(H t _ s /)| 2 )^dm>2e 2 ^y |V(H t /)| 2 <^dm 

and an integration in time yields 

J (H t / 2 - (H t /) 2 - 2I 2i ^)|V(H i /)| 2 ^dm > 0. (6.18) 

Since p is arbitrary nonnegative, neglecting the term (H t /) 2 we get the bound (6.15). We 
can now use Theorem 6.2 to obtain (6.16). □ 

By duality one immediately gets: 
Corollary 6.9 (Wi-L 1 regularization) For every x, y £ suppm and t > we have 

Vh K (t) J \ Pt [x](z) - p t [y](z)\dm(z) <d(x,y). (6.19) 

More generally, the map : i — >■ dJ^(/i)/dm satisfies 

y/hK(t) ||h t M-h t i/|| L i (JC|m) < Wi((i,v). (6.20) 

6.3 Stability 

Here we prove that the Riemannian Ricci curvature bounds are stable w.r.t. D-convergence. 
Notice that we will prove this by showing that condition (Hi) of Theorem 5.1, namely the 
EVI property, is stable w.r.t. D-convergence. 
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Theorem 6.10 (Stability) Let (X n ,d n ,m n ) G X, n G N, be RCD(K, oo) spaces. If 

lim D((X n ,d n ,m n ),(X,d,m)) =0, 

n—s-oo 

t/ien (X, d,m) is a RCD(K, oo) space as u>eZZ. 

Proof. We pass to the limit in (in) of Theorem 5.1. By Proposition 2.21 it is sufficient to 
prove that for any measure p = pm with p G L°°(X, m), there exists a continuous curve (/^) 
on [0, oo) starting from p which is locally absolutely continuous on (0, oo) and satisfies 

e K(s-t) x 

W%(p, s , v) - -Wi(p t , v) + l K {s - t)Ent m (p s ) < l K (s - t)Ent m (u) Vi < s, (6.21) 

for any v G &2(X) with bounded density. Let C := ||p||ooj choose optimal couplings 
(d n ,7 n ) G Opt((d,tn), (d n ,tn n )) and define p n := (j n )$p G ^ 2 (X n ). Since (X n ,d n ,m n ) 
is a RCD(K, oo) space, we know that there exists a curve t i— )• G ^2(^"n) starting from 
/j" such that 

^I-^WKm?, ^ n ) - v n ) + Ix(s - t) EntmCM?) < M« - i) Ent m (^) Vt < s, 

(6.22) 

where i/ n := (7 n )()i y . By the maximum principle (Proposition 2.18) we get p™ < Cm n for any 
n, t. Also, the energy dissipation equality (2.26) yields that 

^ J* \p?\ 2 dr < Ent ro «(/x") < ClogC, (6.23) 

so that the curves (pf) are equi-absolutely continuous. 

Now, define pf := (7 n )7 G S^2(X) for any n, t and notice that by (i) of Proposition 2.2 
we have p™ < Cm for any n, t. 

We claim that the set of measures in &2(X) which are absolutely continuous w.r.t. m 
and with density bounded bounded above by C is compact w.r.t. W 2 - Indeed the measure m 
is tight and, since it has finite second moment, also 2-uniformly integrable. Thus the same is 
true for the set of measures less than Cm, which is therefore compact (see [3] Section 5.1 for 
the relevant definitions and properties). 

By a diagonal argument we obtain a subsequence f 00 such that p% k — > pt in 
{&2{X),W2) as k — > oo for any t G Qfl [0, oo) and some pt G £P 2 (X). The equi-absolute 
continuity of the (/u")'s granted by (6.23), the uniform bound on the densities and the equicon- 
tinuity of ("fn)^ 1 ((H) of Proposition 2.2) grant that there is convergence for all times to a 
limit curve (pt) C &2(X) which is absolutely continuous as well. 

To conclude, notice that by (2.11) we have W2 (p™ k , v nh ) — > W2(pt,v) for any t G [0, 00), 
that lower semicontinuity and marginal monotonicity of the entropy yield 

Ent m (p t ) < liminf Ent m (/}]>) < Ent m (p™ k ) 

and Ent mn (z/ ra ) < Ent m (f). Thus, we can pass to the limit in (6.22) to get (6.21). □ 

We remark that it looks much harder to pass to the limit in (ii) of Theorem 5.1, because in 
general we gain no information about convergence of Cheeger's energies by the D-convergence 
of the spaces. To see why, just observe that in [39] it has been proved that any space 
(X, d , m) G X can be D-approximated by a sequence of finite spaces and that in these spaces 
Cheeger's energy is trivially null. 
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6.4 Tensorization 

In this section we shall prove the following tensorization property of RCD(K, oo) spaces: 



Theorem 6.11 (Tensorization) Let (X, dx,mx), (^dy,my) G X and define the product 
space (Z, d,m) G X as Z := X x Y, m := mx x my and 



Assume that both (X, dx,mx) and (Y, dy,my) are RCD(K, oo) and nonbranching. Then 
(Z, d,m) is RCD(K, oo) and non branching as well. 

The proof of this result is not elementary. Before turning to the details, we comment on 
the statement of the theorem: the non branching assumption is needed in particular because, 
up to now, it is not known whether the CD(K, oo) tensorizes or not: what is known is that the 
product of two nonbranching CD(K, oo) spaces is CD(K, oo) [39, Proposition 4.16]. Thus, 
the result follows combining this tensorization property with another tensorization property 
at the level of Cheeger's energies, proved in Theorem 6.17, that ensures that Cheeger's energy 
in Z is a quadratic form. Finally we use the nonbranching assumption once more to show 
that (Z, d) is nonbranching as well and therefore strong CD(K, oo) holds. 

Throughout this section we assume that the base spaces (X, dx , nix), (^dy,my) are 
RCD(K, oo), even though for the proof some intermediate results suffice weaker assumptions. 

Keeping the notation of Theorem 6.11 in mind, given / : Z — > M we shall denote f x the 
function f(x, •) and by f y the function /(•, y). Having in mind Beppo-Levi's pioneering paper 
[24], we denote by BL 1 ' 2 (Z, d, m) the space of functions / G L 2 (Z,m) satisfying: 



is well defined and belongs to L 2 (Z,m). 

Accordingly, we shall denote by Ch c : L 2 (Z,m) — > [0, oo) the quadratic form associated to 
|V/| C , namely 



if / G BL l,2 (Z, d, m), +oo otherwise. It is not hard to show that the two terms which 
define Ch c are L 2 (Z, m)-lower semicontinuous, which implies in particular that Ch c is lower 
semicontinuous: indeed, considering for instance J Ch Y (f x ) dm(x), suffices to check the lower 
semicontinuity on (fastly) converging sequences satisfying J2 n II/™ — /III < 00 • By Fubini's 
theorem these sequences satisfy J2 n ll/n — /^lli dm^) < oo, so that /" — > f x in L 2 (Y, my) 
for m-a.e. x £ X; then, the lower semicontinuity of Cheeger's functional Cb Y in the base 
space Y and Fatou's lemma provide the lower semicontinuity (the same argument applies to 




(a) f x G D(CU Y ) for m x -a.e. i£l and f y G D(Ch x ) for my-a.e. y G X. 

(b) \Vfv\l (x) G L\Z,m) and \Vf x \ 2 w (y) G L\Z, m ). 
For any / G BL 1,2 (Z, d, m) the cartesian gradient 



\Vf\c(x,y) := s/WFW) + WFW) 




fCh*(^)dm(y)). 
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Lemma 6.12 If f G Lip(Z) then \Vf\ w < a/|V/ x | 2 + \Vf y \ 2 m-a.e. in Z. In particular 



|V/U < ^<? 2 + <?1 m-o-e. *n Z (6.24) 

whenever g±, g<i : Z — > R are bounded Borel functions such that gi(x, •) is a upper semicon- 
tinuous upper gradient of f x and g2(-,y) is a upper semicontinuous upper gradient of f y . 

Proof. We will prove that the cartesian slope \/\^7f x \ 2 + |V/ y | 2 is a weak upper gradient 
for Lipschitz functions /. If 7 = (7,7) £ AC 2 ([0, 1]; Z) we need to prove that for a.e. t the 
inequality 

o 7)1 (t) < V|VK| 2 (7D + |V/^| 2 (7f ) \/\% X I 2 + l7 t y ! 2 (6-25) 

holds. A pointwise proof of this inequality seems not to be easy, on the other hand, working 
at the level of distributional derivatives, in [3, Lemma 4.3.4] it is proved that a.e. in [0, 1] it 
holds 

,d,_ \f{ll h ,lI)-f{l?,lY)\ , .. 1/(7^7^-/(7^)1 
Itt(/°7)IW <hmsup hhmsup , 

so that 

|^(/°7)|(<) < |V/ 7 * y |(7f)l7f \ + Wf^\hY)\iY\ a.e. in [0,1], 

from which (6.25) readily follows. The estimate (6.24) follows noticing that any upper semi- 
continuous upper gradient bounds the slope from below. □ 

In the next lemma we will improve the inequality \Vf\ w < ^\Vf x \ 2 + \V f y \ 2 obtaining 
|V/| C in the right hand side. To this aim we consider, as a regularizing operator, the product 
semigroup H£ in L 2 {Z,m), pointwise defined by 

H t 7(z,i/):= J J f(x',y')pf[x}(x')pY[y](y')d mx (x')dm Y (y') (6.26) 

where pf[x]{x') and pj[y](y') are the transition probability densities in the base spaces (see 
also (6.28) below for an equivalent description in terms of iterated operators). It is easy 
to show that Hf retains the same properties of its "factors" H^, in the base spaces, in 
particular it is mass preserving, self-adjoint, satisfies the maximum principle, regularizes from 
L°°(Z, m) to Cj)(Z) and leaves Lip(Z) invariant. In addition, H( can also be viewed as the 
L 2 (Z, m)-gradient flow of Ch c , namely the solution to 

^ft = A c /t (6-27) 

where the linear operator A c is defined in terms of the Laplacians in the base spaces Ax , Ay 
by A7(x,y) := A x f y (x) + A Y f x (y). 

Lemma 6.13 For all f £ Lip(Z) it holds \Vf\ w < |V/| C m-a.e. in Z. 

Proof. Set F\(x,y) := \Vf y \ w (x) and F2(x,y) := \V f x \ w (y). We consider the regularization 
ft := Hf / of /. Writing 

f t (x,y) = H?G(;y)(x) (6.28) 
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with G(x',y) := Hj f(x' , )(y), we can use first Theorem 6.2 and then the convexity of g i— >■ 
|V#|™ to get 

iVtfKz) < e-^(Hf|VG(-,y)|^) 1/2 (x) 
< e-«(HfHrif) 1/a (x,y) 
= e-^(H^ 2 ) 1/2 (x, y ). 

Analogously, reversing the role of the variables we get 

|V/f|(y)<e-^(H^|) 1/2 . 

So, we may take gi(x,y) := e'^HjF 2 ) 1 / 2 and 52 := e _i!rt (Hji ? f ) x / 2 in Lemma 6.12 to get 

|V/ t | 2 < e~ 2Kt H c t (F? + F 2 ) = H^|V/| 2 m-a.e. in Z. 

Letting t 1 the stability property of weak upper gradients and the strong continuity of the 
semigroup provide the result. □ 

The proof of the converse inequality is more involved. It rests mainly in an improvement 
in product spaces of the Hamilton- Jacobi inequality satisfied by the Hopf-Lax semigroup (see 
Lemma 6.14 below) and on its consequence, an improved metric derivative that we obtain 
in Lemma 6.15 along solutions to the L 2 (Z, m)-gradient flow of Ch c defined in (6.26) or, 
equivalently, in (6.27). 

In [2, Section 3], a very detailed analysis of the differentiability properties of the Hopf-Lax 
semigroup 

Q t g(w):= mi g(w') + ^-d 2 w (w',w) (6.29) 
in a metric space (W, dw) has been made. The analysis is based on the quantities 
Dg(w,t) := suplimsup dw(w, w' ), D~(w,t) := inf liminf dw(w,w n ), 

where the supremum and the infimum run among all minimizing sequences (w n ) in (6.29). 
These quantities reduce respectively to the maximum and minimum distance from w of min- 
imizers in the locally compact case. Confining for simplicity our discussion to the case of 
bounded functions, which suffices for our purposes, it has been shown that and D~ 
are respectively upper and lower semicontinuous in W x (0, 00), that D~(-,t)/t is an upper 
gradient of Qtg and that the following pointwise equality holds: 

d+ (D+(w,t)) 2 

^QtaW + 9 y = 0, (6.30) 

where we recall that d + /dt stands for right derivative (part of the statement is its existence 
at every point). Notice that, since D+(-,t)/t > D~(-,t)/t is an upper semicontinuous upper 

gradient of Qtg, (6.30) implies the Hamilton-Jacobi subsolution property ^-Qt9+\^Qtg\ 2 /2 < 
0, but in the sequel we shall need the sharper form (6.30). 

Lemma 6.14 Let g : Z — > R be a bounded function. Then, for all t > the function Qtg 
satisfies 

d + 1 

-^Qtg + ^Qt9\ 2 c <0 m-a.e. mZ. (6.31) 
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Proof. Taking (6.30) into account and the definition of \X7Qtg\ c (recall the notation f x (y) = 
f(x,y) = f y (x)), suffices to show that for all t > it holds 

[ ^ +(( ^ ),t)]2 > \V(Q t g)y\ 2 Jx) + \V(Q t g)y\l(x) m-a.e. in Z. (6.32) 

In order to prove (6.32), notice that we can minimize first in one variable and then in the 
other one to get 

(Qtg) v (x) = Qf{L t>y ){x), (Q t gT(y) = Qj(Rt, x )(y), (6.33) 

where L tjy (x') := Qjg(x',-)(y) and R t , x {y') ■= Q?g{-, y'){x)- Since D~(-,t)/t is is an upper 
gradient, we see that (6.32) is a consequence of the pointwise inequality 

[D+((x,y),t)\ 2 > [Dl t Jx,t)] 2 + [D Rt Jy,t)] 2 . (6.34) 

In order to prove (6.34), let us consider a minimizing sequence (x n ,y n ) for Q t g(x,y); since 

1 1 

Qtg(x,y) = lim g(x n ,y n ) + — dy(y n ,y) + — A 2 x {x n ,x) 

n— >oo It Zt 

> liminf L\(x n ) + ^-d 2 x (x n ,x) > Qf{L ty )(x) 

we can use (6.33) to obtain that all inequalities are equalities: this implies that the liminf is 
a limit and that and that (x n ) is a minimizing sequence for Qfip(x), with (p(x) = Lt !y (x). 
Analogously, (y n ) is a minimizing sequence for Qjip{y), where ip(y) = Rt lX (y)- Taking into 
account the definitions of D^, this yields (6.34). □ 

Lemma 6.15 (Kuwada's lemma in product spaces) Let f £ L°°(Z, m) be a probability 
density and let ft be the solution of the L 2 -gradient flow of Ch c starting from f. Then 
H t = f t m e ^ 2 pQ for all t > and 

lAtl 2 < / ^^dm fora.e.t>0. (6.35) 
J{ft>o] It 

Proof. The proof can be achieved following verbatim the proof of the analogous result [2, 
Lemma 6.1], this time working with |V/t| c in place of |V/t| w : this replacement is possible in 
view of the improved Hamilton- Jacobi inequality (6.31) and of the calculus rules 

- JgA c fdm< J |V/| c |V ff | c dm, -J ^(/)A c /dm = J ^(/)|V/|?dm, (6.36) 

which follow immediately by the analogous properties of the partial Laplacians. □ 

Proposition 6.16 We have D(Ch) C BL 1 ' 2 {Z, d, m). In addition, for all f G L>(Ch) there 
exist f n G -D(Ch c ) converging to f in L 2 (Z,m) and satisfying 

limsupCh c (/ n ) < Ch(/). (6.37) 
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Proof. We argue exactly as in [2, Theorem 6.2], where we identify weak upper gradients 
and relaxed gradients, the only difference being the use of the gradient flow of Ch c and the 
improved estimate (6.35). 

Pick / G D(Ch). With a truncation argument, we can assume that c _1 > / > c > 
m-almost everywhere in Z with f f 2 dm = 1. We consider the gradient flow (ht) of Ch c with 
initial datum h := f 2 , setting /j,t = htm, and we apply Lemma 6.15. The maximum principle 
yields c" 1 > ft > c and a standard argument based on (6.27) and (6.36) yields the energy 
dissipation identity 

ft log f t dm = - J ^Ms. dm. (6.38) 

Let g = h \Vh\ w , notice that by the chain rule we know that log/i is Sobolev along 
almost every curve and use the same argument of [2, Theorem 6.2] to get 



J (h log h — h t log ht) dm < J log h(h - h t ) dm < ( J J g 2 h s dmds^ ^ J \fi s \ 2 di 
Now, inequality (6.35) gives 



1/2 



1 



[hlogh- htloght) dm < -J Jg z h s dmds + -J \fi s \ 2 ds 

I ft r i rt p I 2 

Recalling the entropy dissipation formula (6.38) we obtain 



\Vh 12 



-dmds< / / g 2 h s dmds. 



Now, the chain rule and the identity g = 2f~ 1 \Vf\ w give J* C\^ c (^/h^) ds < f* f IV/I^/ -2 ^ dmds, 
so that dividing by t and passing to the limit as t J, we get (6.37), since \Jh~ s are equibounded 
and converge strongly to / in L 2 (Z, m) as s 4- 0. □ 



Theorem 6.17 Let f G L 2 (Z,m). Then f £ D(CU) if and only if f G ,D(Ch c ) and |V/|^ = 
|V/| C m-a.e. m Z. In particular Ch = Ch c is a quadratic form. 

Proof. By Proposition 6.16 we obtain that / G D(Ch) implies / G D(Ch c ) and Ch c (/) < 
Ch(/). If / G Lip(Z), Lemma 6.13 yields |V/| W < |V/| C m-a.e. in Z and the converse 
inequality Ch(/) < Ch c (/). It follows that the functionals and the gradients coincide in 
Lip(Z). Since Lip(Z) is a L 2 (Z, m)-dense and invariant subset for H£, for all / G D(Ch c ) 
we can apply Lemma 4.9 to obtain (/„) C Lip(Z) satisfying Ch c (/ — f n ) — > and we can 
pass to the limit as n — > oo in the inequality \Vf n \ w < |V/ n | c to get |V/| W < |V/| C . Hence, 
Ch(/) = Ch c (/) and the respective gradients coincide. □ 

Proof. (of Theorem 6.11) By [39, Proposition 4.16] we know that (Z, d,m) is CD(K, oo), 
while Theorem 6.17 ensures that Cheeger's energy in this space is a quadratic form. 

The proof that (Z, d,m) is nonbranching is simple, and we just sketch the argument. It 
is immediately seen that the non branching property is implied by the stability of constant 
speed geodesies under projections, namely if 7 = (7 ,"f Y ) G Geo(Z), then ^ x G Geo(X) and 
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7 £ Geo(y). This stability property can be shown as follows: in any metric space, constant 
speed geodesies are characterized by 



while for all other curves the inequality > holds. Since |7j| 2 = \^\ 2 + \ jY\ 2 wherever 
the metric derivatives of the components exist, we obtain J^\j^\ 2 dt = d^- (7^ , 7^) and 
Jq 1 |7tH 2 dt = dy(7^,7^), so that both j x and ^ Y are constant speed geodesies. 

Finally, we prove the strong CD(K, oo) property. Since the space is nonbranching, by 
Remark 3.2 it is sufficient to prove that it is a CD(K, oo) space. To prove this, we argue 
exactly as in [39, Lemma 4.7 and Proposition 4.16], taking into account the tightness of the 
sublevels of Ent m to remove the compactness assumption. We omit the details. □ 

6.5 Locality 

Here we study the locality properties of RCD(K, oo) spaces. As for the tensorization, we 
will adopt the point of view of the definition coming from the Dirichlet form, rather than 
the ones coming from the properties of the heat flow. The reason is simple. On one side, 
the heat flow does not localize at all: even on R rf to know how the heat flow behaves on 
the whole space gives little information about the behavior of the flow on a bounded region 
(we recall that, with our definitions, the heat flow that we consider reduces to the classical 
one with homogeneous Neumann boundary condition). On the other hand, Cheeger's energy 
comes out as a local object, and we will see that the analysis carried out in Section 4.3 and 
Section 6.2 will allow us to quickly derive the locality properties we are looking for. 

There are two questions we want to answer. The first one is: say that we have a 
RCD(K, oo) space and a convex subregion, can we say that this subregion - endowed with 
the restricted distance and measure - is a RCD(K, oo) space as well? The second one is: 
suppose that a space is covered by subregions, each one being a RCD(K, oo) space, can we 
say that the whole space is RCD(K, oo)? 

The first question has a simple answer: yes. The second one is more delicate, the problem 
coming from proving the convexity of the entropy. The analogous question for CD(K, oo) 
spaces has, as of today, two different answers. On one side there is Sturm's result [39, 
Theorem 4.17] saying that this local-to-global property holds if the space is nonbranching 
and the domain of the entropy is geodesically convex. On the other side there is Villani's 
result [43, Theorem 30.42]) which still requires the space to be nonbranching, but replaces 
the global convexity of the domain on the entropy, with a local one one, roughly speaking 
a (X, d,m) is finite-dimensional near to every point" (in a sense which we won't specify). 

Our answer to the local-to-global question in the RCD(K, oo) setting will be based on the 
following assumptions, besides the obvious one that the covering subregions are RCD(K, oo): 
the space is nonbranching and CD(K, oo), so that independently from the approach one has 
at disposal to prove the local to global for CD(K, oo), as soon as the space is nonbranching, 
RCD(K, oo) globalizes as well. 

We say that a subset Y of a metric space (X, d) is convex if, for any x, y G Y, there exists 
a geodesic 7 connecting x to y is contained in Y. 

Theorem 6.18 (Global to Local) Let (X, d,m) G X be a RCD(K, 00) space and let Y C 
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X be a closed convex set such that m(<9Y) = and m(Y) > 0. Then (Y, d,my) is a 
RCD(K, oo) space as well, where my := (m(F))" 1 mLF. 

Proof. Since Y is closed, (Y, d,my) G X. Let us first remark that for every \i G £P%(X) 

Ent my ( / u) < oo <^ supp^x C Y, Ent m (jU) < oo, (6.39) 

and in this case Ent my (/i) = cy + Ent m (^), where cy = log(m(Y)). Therefore, thanks to 
the RCD(K, oo) property of (X, d,m), the functional Ent my is if-geodesically convex on any 
Wasserstein geodesic (fj, s ) with supp /x s C Y for all s£ [0,1]. Such a geodesic exists since (Y, d) 
and thus (^(Y), Wb) are geodesic spaces: in particular, (Y, dy,my) is a strong CD(K, oo) 
space. Thus, to conclude we simply apply (m) of Theorem 4.19. □ 

The previous result is similar to the following lower Ricci curvature bound for weighted 
spaces: 

Proposition 6.19 (Weighted spaces) Let (X, d,m) G X be a RCD(K, oo) space and let 
V : X \— > M. be a continuous H-geodesically convex function bounded from below with f e~ v dm = 
1. Then (X, d,e~ v m) is a RCD{K + H, oo) space. 

The proof follows by the same arguments, applying [39, Proposition 4.14] (showing that 
(X, d,e~ v m) is a strong CD(K + H,oo) space), [2, Lemma 4.11] for the invariance of weak 
gradients with respect to the multiplicative perturbation, and (Hi) of Theorem 4.18. 
We conclude this section with the globalization result. 

Theorem 6.20 (Local to Global) Let (X,d,m) G X and let {Y}j g / be a cover of X made 
of finitely or countably many closed sets of positive m-measure, with m» := [m(Yj)] mLYj. 
Assume that the Cheeger functional associated to (Yj,d,trij) is quadratic (in particular when 
(Yj, d, xrii) is a RCD(K, oo) space) for every i £ L. Assume also that (X, d, m) is nonbranching 
and CD(K, oo). Then (X,d,vn) is a RCD(K, oo) space. 

Proof. We start proving that Ch is a quadratic form. Notice that it holds 

2Ch(/)= /" |V/|£dm = W |V/|*dm 

where := Y \ Uj<jYj. Let fi := f\ y and recall that by Theorem 4.19 we know that 
\Vfi\ w ,Yi = \^f\w m-a.e. on Y- Also, by Theorem 4.18 we have that for i and any Borel 
subset A of Y, the map / i— )• J A |V/|^ Y dm is quadratic. Choosing A = X{ the conclusion 
follows. 

The fact that (X, d,m) is a strong CD(K, oo) space follows from the fact that it is non- 
branching and CD(K, oo), as in Remark 3.2. □ 
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